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(57) Abstract: Novel CARD-3. CARD^L, CARD-4S. CARD-4Y. CARD^Z. CARD-5. and CARD-6 polypeptides, proteins, 
and nucleic acid molecules are disclosed. In addition to isolated CARD-3. CARD-4L. CARD^S, CARD^Y, CARD^Z. 
CARD-5. and CARD-6 proteins, and the invention further provides CARD-3, CARD-4L, CARD-4S, CARD^Y, CARD-4Z, 
CARD-5. and CARD-6 fusion proteins, antigenic peptides and anti-CARDS-3. anti-CARD-4L and ami-CARD-lS. anti-CARD-4Y. 
anti-CARD-*Z. ami-CARDS-5. and anti-CARD-6 antibodies. The invention also provides CARD-3. CARXML. CARD-IS. 
CARD-4Y, CARD-tZ. CARD-5, and CARD-6 nucleic acid molecules, recombinant expression vectors containing a nucleic acid 
molecule of the invention, host cells into which the expression vectors have been introduced and non-human transgenic animals in 
which a CARD-3. CARD-4L. CARD-4S. CARD-4Y. CARD-4Z. CARD-5. and CARD-6 gene has been introduced or disrupted. 
Diagnostic, screening and therapeutic methods utilizing compositions oi the invention are also provided. 
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NOVEL MOLECULES OF THE CARD-RELATED 
PROTEIN FAMILY AND USES THEREOF 

Background of the Invention 
5 In multicellular organisms, homeostasis is maintained by balancing 

the rate of cell proliferation against the rate of cell death. Cell proliferation is 
influenced by numerous growth factors and the expression of proto-oncogenes. 
which typically encourage progression through the cell cycle. In contrast, 
numerous events, including the expression of tumor suppressor genes, can lead to 

10 an arrest of cellular proliferation. 

In differentiated cells, a particular type of cell death called apoptosi-s 
occurs when an internal suicide program is activated. This program can be 
initiated by a variety of external signals as well as signals that are generated 
within the cell in response to. lor example, genetic damage. For many years, the 

15 magnitude of apoptotic cell death was not appreciated because the dying cells arc 
quickly eliminated by phagocytes, without an inflammatory response. 

The mechanisms that mediate apoptosis have been intensively studied. 
These mechanisms involve the activation of endogenous proteases, loss of 
mitochondrial function, and structural changes such as disruption of the 

2 0 cytoskeleton. cell shrinkage, membrane blebbing. and nuclear condensation due 
to degradation of DNA. The various signals that trigger apoptosis are thought to 
bring about these events by converging on a common cell death pathway that is 
regulated by the expression of genes that are highly conserved from worms, such 
as C. eleguns. to humans. In fact, invertebrate model systems have been 

2 5 invaluable tools in identifying and characterizing the genes thai control apoptosis. 

Through the study of invertebrates and more evolved animals, numerous genes 
that are associated with cell death have been identified, but the way in which their 
products interact to execute the apoptotic program is poorly understood. 

Caspases. a class of proteins central to the apoptotic program, are 

3 0 responsible for the degradation of cellular proteins that leads to the 

morphological changes seen in cells undergoing apoptosis. Caspases are cysteine 
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proteases having specificity for aspartate at the substrate cleavage site. An 
effector caspase is activated by an initiator caspase which cleaves the effector 
caspase at specific internal aspartate residues resulting in the separation of the 
large and small subunits of the effector caspase. For example, one of the 
5 caspases identified in humans was previously known as the interleukin-la (IL- 
1 a) converting enzyme (ICE), a cysteine protease responsible for the processing 
of pro-IL- 1 a to the active cytokine. Overexpression of ICE in Rat- 1 fibroblasts 
induces apoptosis (Miura et aL Cell 75:653. 1993). 

Many caspases and proteins that interact with caspases possess 
1 0 domains of about 60 amino acids called a caspase recruitment domain (CARD). 
Hofmann et al. (TIBS 22:155, 1997) and others have postulated that certain 
apoptotic proteins bind to each other via their CARDs and that different subtypes 
of CARDs may confer binding specificity, regulating the activity of various 
caspases. for example. The functional significance of CARDs have been 
15 repeatedly demonstrated. For example. Duan et al. (Mature 385:86, 1997) 
showed that deleting the CARD at the N-terminus of RAIDD abolished the 
ability of RAIDD to bind to caspases. 

Caspase-9 activation may precede the activation of all other cell 
death-related caspases in the mitochondrial pathways of apoptosis (See et al.. J. 
2 0 Cell Biol. 144:281-292. 1999). Inactive procaspase-9 is activated by interaction 
with a complex which includes Apaf-1. a CARD-containing protein, and other 
factors (Li et al.. Cell 91:479. 1997; Srinivasula et al.. Mol. Cell 1 :949-959, 
1 998). Recognition of procaspase-9 by Apaf-1 occurs primarily through the 
interaction of the CARD of Apaf-1 with the prodomain of caspase-9. The CARD 
2 5 of Apaf- 1 shares about 20% sequence identity with the prodomain of procaspase- 
9. The prodomain of caspase-9 is a member of the CARD family of apoptotic 
signalling motifs (Hofmann and Bucher. Trends in Biochem. Sci. 22:155-156. 
1997). A similar domain is present in caspase activating proteins CED-4 and 
RAIDD/CRADD as well as in initiator caspases CED-3 and caspase-2/lCI 1-1 
30 (Duan and Dixit. Nature 385:86-89. 1997: Ahmad et aL, Cancer Res. 57:615-619. 
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1997; Alnemri et al., Cell 87:171, 1996). Apaf-1 can bind several other caspases, 
e.g.. caspase-4 and caspase-8 (lnohara et al.. J. Biol. Chem. 273:12296-12300, 
1998). 

Nuclear factor-KB (NF-kB) is a transcription factor expressed in many 
5 cell types and which activates homologous or heterologous genes that have kB 
sites in their promoters. Molecules that regulate NF-kB activation play a critical 
role in both apoptosis and inflammation. Quiescent NF-kB resides in the 
cytoplasm as a heterodimer of proteins referred to as p50 and p65 and is 
complexcd with the regulatory protein IkB. NF-kB binding to IkB causes NF-kB 

10 to remain in the cytoplasm. At least two dozen stimuli that activate NF-kB are 
known (New England Journal of Medicine 336:1066. 1997 ) and they include 
cytokines, protein kinase C activators, oxidants, viruses, and immune system 
stimuli. NF-kB activating stimuli activate specific IkB kinases that 
phosphorylatc IkB leading to its degradation. Once liberated from IkB. NF-kB 

1 5 translocates to the nucleus and activates genes with kB sites in their promoters. 
The proinflammatory cytokines TNF-a and 1L-1 induce NF-kB activation by 
binding their cell-surface receptors and activating the NF-KB-inducing kinase. 
NIK. and NF-kB. NIK phosphorylates the IkB kinases a and p which 
phosphorylatc IkB, leading to its degradation. 

2 0 NF-kB and the NF-kB pathway has been implicated in mediating 

chronic inflammation in inflammatory diseases such as asthma, ulcerative colitis, 
rheumatoid arthritis (Epstein. New England Journal of Medicine 336:1066. 1097) 
and inhibiting NF-kB or NF-kB pathways may be an effective way of treating 
these diseases. NF-kB and the NF-kB pathway has also been implicated in 

2 5 atherosclerosis (Navab et al.. American Journal of Cardiology 76: ISC. 1095). 

especially in mediating fatty streak formation, and inhibiting NF-kB or NF-kB 
pathways may be an effective therapy for atherosclerosis. Among the genes 
activated by NF-kB are clAP-1 . cl AP-2. TRAF1. and TRAF2. all of which have 
been shown to protect cells from TNF-u induced cell death (Wang et al.. Science 

3 0 281:1680-83. 1998). CLAP, a protein which includes a CARD, activates the 
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Apaf-l-caspase-9 pathway and activates NF-kB by acting upstream of NIK and 
IkB kinase (Srinivasula et aL supra). 

Bcl-2 family proteins arc important regulators of pathways involved in 
apoptosis and can act to inhibit or promote cell death. Expression of certain ami- 
5 apoptotic Bcl-2 family members is commonly altered in cancerous cells, 
suppressing programmed cell death and extending tumor growth. Among the 
anti-apoptotic Bcl-2 family members thus far identified are Boo, Bcl-2. Bcl-x L , 
Bcl-w. NR-13. Al. and Mcl-2. Pro-apoptotic Bcl-2 family members include Bax, 
Bak, Bad, Bik. Bid. Hrk, Bim, and Bok/Mtd. Significantly, the anti-apoptotic 
l o Bcl-2 family member. Bcl-xi , has been shown to interact with Apaf-1 and block 
Apaf-1 -dependent caspase-9 activation (Hu et al., Proc. Natl Acad. Sci. 
95:4386-4391, 1998). Boo. another anti-apoptotic Bcl-2 family member, 
interacts with Apaf-1 and caspase-9. Bak and Bik. pro-apoptotic Bcl-2 family 
members, can disrupt the association of Boo with Apaf-1 (Song et al.. EK4BO J. 
15 18:167-178. 1999). Boo is thought to be involved in the control of ovarian 
atresia and sperm maturation. Diva, another member of the Bcl-2 family, 
inhibits binding of Bel -x 3 . to Apf-1, preventing Bcl-x L from binding to Apaf-1. 

Neurotrophic (e.g.. NGF). which are best known as neuronal survival 
factors, can mediate apoptosis via the p75 neurotrophin receptor (p75 N!K ). It is 
2 0 thought that p75 NTR activation can lead to NF-kB activation (Carter et al.. Science 
272:542-545. 1996). It has been proposed that p75 N1 "-mediated cell death acts to 
ensure rapid cell death when a neuron is unable to obtain sufficient neurotropic 
This mechanism could, for example, cause the elimination of neurons that reach 
an inappropriate target or that reach an appropriate target at an inappropriate time 
25 (Miller and Kaplan. Cell Death and Diff. 5:343-345. 1998). 
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Summary of the Invention 

The present invention is based, at least in part, on the discover}' of 
genes encoding CARD-3, CARD-4, CARD-5, and CARD-6. A full-length 
human CARD-3 cDNA is presented. Several CARD-4 cDNAs are presented. 
5 Briefly, the CARD-4 gene can express a long transcript thai encodes CARD-4L. 
a short transcriprthat encodes partial CARD-4S, or two CARD-4 splice variants 
(CARD-4 Y and CARD-4Z). A full length cDNA sequence for the murine 
ortholog of CARD-4L is also presented. Full-length cDNAs encoding murine 
and human CARD-5 are presented. In addition, full-length cDNAs encoding 

1 C human and rai CARD-6 are presented. 

CARD-3. CARD-4. CARD-5. and CARD-6 are intracellular proteins 
lhat arc predicted to be involved in regulating caspase activation. CARD-4 is 
found to activate the NF-kB pathway and to enhance caspase 9-mediated cell 
death. In addition, proteins that bind to CARD-4 are presented including CARD- 
15 3 and hNUDC. 

The CARD-3 cDNA described below (SEQ ID NO: 1) has a 1620 
open reading frame (nucleotides 2 1 4 to 1833 of SEQ ID NOT; SEQ ID NO:3) 
which encodes a 540 amino acid protein (SEQ ID NO:2). CARD-3 contains a 
kinase domain which extends from amino acid 1 to amino acid 300 of SEQ ID 

2 0 NO:2; SEQ ID NO:4. followed by a linker domain at amino acid 301 to amino 

acid 43 I of SEQ ID NO:2: SEQ ID NO:5 and a CARD at amino acid 432 to 
amino acid 540 of SEQ ID NO:2: SEQ ID NO:6. 

At least four forms of CARD-4 exist in the cell, a long form. CARD- 
41.. a short form. CARD-4S. and two splice variants. CARD-4Y and CARD-4Z. 

2 5 The cDNA of CARD-4 L described below {SEQ ID NO:7) has a 2859 nucleotide 

open reading frame (nucleotides 245-3103 of SEQ ID NO:7: SEQ ID NO:9) 
which encodes a 953 amino acid protein (SEQ ID NO:S). CARD-4L protein 
possesses a CARD domain (amino acids 15-1 14: SEQ ID NO: 10). The 
nucleotide sequence of the full length cDNA corresponding to the murine 

3 0 ortholog of human CARD-4L is presented (SEQ ID NO:42) as is the predicted 
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amino acid sequence of murine CARD-4L (SEQ ID NO:43). A comparison 
between the predicted amino acid sequences of human CARD-4L and murine 
CARD-4L is also depicted in Figure 17. 

Human CARD-4L is also predicted to have a nucleotide binding 
5 domain which extends from about amino acid 198 to about amino acid 397 of 
SEQ ID NO:8; SEQ ID NO: 1 1, a Walker Box "A", which extends from about 
amino acid 202 to about amino acid 209 of SEQ ID NO:8; SEQ ID NO:12, a 
Walker Box "B'\ which extends from about amino acid 280 to about amino acid 
284. of SEQ ID NO:8; SEQ ID NO: 13, a kinase la (P-loop) subdomain. which 
10 extends from about amino acid 127 to about amino acid 212 of SEQ ID NO:8: 
SEQ ID NO:46, a kinase 2 subdomain. which extends from about amino acid 273 
to about amino acid 288 of SEQ ID NO:8: SEQ ID NO:47. a kinase 3a 
subdomain. which extends from about amino acid 327 to about amino acid 338 of 
SEQ ID NO:8; SEQ ID NO: 14. and ten Leucine-rich repeats which extend from 
15 about amino acid 674 to about amino acid 950 of SEQ ID NO:8. The first 

Leucine-rich repeat extends from about amino acid 674 to about amino acid 701 
of SEQ ID NO:8: SEQ ID NO: 15. The second Leucine-rich repeat extends from 
about amino acid 702 to about amino acid 727 of SEQ ID NO:8: SEQ ID NO: 16. 
The third Leucine-rich repeat extends from about amino acid 728 to about amino 
2 0 acid 754 of SEQ ID NO:8: SEQ ID NO: 17. The fourth Leucine-rich repeat 
extends from about amino acid 755 to about amino acid 782 of SEQ ID NO:8: 
SEQ ID NO: 18. The fifth Leucine-rich repeat extends from about amino acid 
783 to about amino acid 810 of SEQ ID NO:8: SEQ ID NO: 19. The sixth 
Leucine-rich repeat extends from about amino acid 81 1 to about amino acid 838 

2 5 or SEQ ID NO:8: SEQ ID NO:20. The seventh Leucine-rich repeat extends from 

about amino acid 839 to about amino acid 866 of SEQ ID NO:8: SEQ ID NO:21 . 
The eighth Leucine-rich repeat extends from about amino acid 867 to about 
amino acid 894 of SEQ ID NO:8: SEQ ID NO:22. The ninth Leucine-rich repeal 
extends from about amino acid 895 to about amino acid 922 of SEQ ID NO:8: • 

3 0 SEQ ID NO:23 and the tenth leucine-rich repeat extends from about amino acid 

- 6 - 
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923 to about amino acid 950 of SEQ ID NO:8; SEQ ID NO:24. 

The partial cDNA of CARD-4S described below {SEQ ID NO:25) has 
a 1470 nucleotide open reading frame (nucleotides 1-1470 of SEQ ID NO:25; 
SEQ ID NO:27) which encodes a 490 amino acid protein (SEQ ID NO:26). 
5 CARD-4S protein possesses a CARD domain (amino acids 1 -74 of SEQ ID 

NO:26; SEQ ID NO:28). CARD-4S is predicted to have a P-Loop which extends 
from about amino acid 163 to about amino acid 1 70 of SEQ ID NO:26; SEQ ID 
NO:29. and a Walker Box which extends form about amino acid 241 to about 
amino acid 245 of SEQ ID NO:26: SEQ ID NO:30. 

1 0 A human CARD-4Y nucleotide cDNA sequence is presented (SEQ ID 

NO:38 ) as is the amino acid sequence of the predicted CARD-4Y product (SEQ 
ID NO:39). A human CARD-4Z nucleotide cDNA sequence is presented (SEQ 
ID NO:40) as is the amino acid sequence of the predicted CARD-4Z product 
(SEQ ID NO:41 ). A comparison of the CARD-4Y, CARD-4Z. and human 
15 CARD-4L predicted amino acid sequences is also shown in Figure 14. 

The 761 nucleotide murine CARD-5 cDNA described below (SEQ ID 
NO:60) has a 579 nucleotide open reading frame (nucleotides 89 to 668 Of SEQ 
ID NO:60; SEQ ID NO:62) which encodes a 193 amino acid protein (SEQ ID 
NO:61 ). Murine CARD-5 contains a CARD domain which extends from amino 

2 0 acid 1 1 0 to amino acid 1 79 of SEQ ID NO:6 1 (SEQ ID NO:66). 

The 740 nucleotide human CARD-5 cDNA described below (SEQ ID 
NO:48) has a 585 nucleotide open reading frame (nucleotides 54 to 639 of SEQ) 
ID NO:48: SEQ ID NO:50) which encodes a 195 amino acid protein (SEQ ID 
NO:49). Human CARD-5 contains a CARD domain which extends from amino 

2 5 acid 11 1 to amino acid 181 of SEQ ID NO:49 (SEQ ID NO:58). 

The 5252 nucleotide rat CARD-6 cDNA described below (SEQ) ID 
NO:51 ) has a 2715 nucleotide open reading frame (nucleotides 169 to 2883 of 
SEQ ID NO:51: SEQ ID NO:53) which encodes a 905 amino acid protein (SEQ 
ID NO:52i. Rai CARD-6 contains a CARD domain which extends from amino 

3 0 acid 1 to ammo acid 1 08 of SEQ ID NO:52 { SEQ ID NO:59). Rat CARD-G also 
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has a proline-rich C-terminus which extends from amino acid 698 to amino acid 
905 of SEQ ID NO:52 (SEQ ID NO:65). This proline-rich domain includes five 
putative SH3 binding sites. These binding sites have the sequence PXXP and are 
located at amino acids 710 to 713 (PAHP), 806 to 809 (PLRP), 819 to 822 
5 (PIPP). 857 to 860 (PPHP). and 881 to 884 (PSQP) of SEQ ID NO:52. 

The 4244 human CARD-6 cDNA described below (SEQ ID NO:54) 
has a 3 1 1 1 nucleotide open reading frame (nucleotides 200 to 33 1 0 of SEQ ID 
NO:54; SEQ ID NO:56) which encodes a 1037 amino acid protein (SEQ ID 
NO:55). Human CARD-6 includes a CARD domain which extends from amino 
10 acid 5 to amino acid 92 of SEQ ID NO:55 (SEQ ID NO:64). 

Like other proteins containing a CARD domain. CARD-3, CARD-4. 
CARD-5. and CARD-6 to participate in the network of interactions that lead to 
caspase activity. Human CARD-4L likely plays a functional role in caspase 
activation similar to that of Apaf-1 (Zou et al. (1997) Cell 90:405-413). For 
1 5 example, upon activation, CARD-4L binds a nucleotide, thus allowing CARD-4L 
to bind and activate a CARD-containing caspase via a CARD-CARD interaction, 
leading to apoptoiic death of the cell. CARD-3. CARD-4, CARD-5. and CARD- 
6 molecules arc useful as modulating agents in regulating a variety of cellular 
processes including cell growth and cell death. In one aspect, this invention 
2 0 provides isolated nucleic acid molecules encoding CARD-3. CARD-4, CARD-5. 
or CARD-6 proteins or biologically active portions thereof, as well as nucleic 
acid fragments suitable as primers or hybridization probes for the detection of 
CARD-3, CARD-4. CARD-5. or CARD-6 encoding nucleic acids. 

The invention encompasses methods of diagnosing and treating 

2 5 patients who are suffering from a disorder associated with an abnormal level or 

rate (undesirably high or undesirably low) of apoptotic cell death, abnormal 
activity of the Fas/APO-1 receptor complex, abnormal activity of the TNF 
receptor complex, or abnormal activity of a caspase by administering a 
compound that modulates the expression of CARD-3. CARD-4. CARD-5. or 

3 0 CARD-6 (at the DNA. mRKA or protein level, e.g.. by altering mRNA splicing) 
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or by altering the activity of CARD-3, CARD-4, CARD-5. or CARD-6. 
Examples of such compounds include small molecules, antisense nucleic acid 
molecules, ribozymes, and polypeptides. 

Certain disorders are associated with an increased number of 
5 surviving cells, which are produced and continue to survive or proliferate when 
apoptosis is inhibited or occurs at an undesirably low rate. Compounds that 
modulate the expression or activity of CARD-3, CARD-4, CARD-5, or CARD-6 
can be used to treat or diagnose such disorders. These disorders include cancer 
(particularly follicular lymphomas, chronic myelogenous leukemia, melanoma, 

10 colon cancer, lung carcinoma, carcinomas associated with mutations in p53. and 
hormone-dependent tumors such as breast cancer, prostate cancer, and ovarian 
cancer). Such compounds can also be used to treat viral infections (such as those 
caused by herpesviruses, poxviruses, and adenoviruses). Failure to remove 
autoimmune cells that arise during development or that develop as a result oi" 

15 somatic mutation during an immune response can result in autoimmune disease. * 
Thus, autoimmune disorders can be caused by an undesirably low levels of 
apoptosis. Accordingly, modulators of CARD-3. CARD-4. CARD-5. or CARD- 
6 activity or expression can be used to treat autoimmune disorders (e.g.. systemic 
lupus erythematosis. immune-mediated glomerulonephritis, and arthritis). 

2 0 Many diseases are associated with an undesirably high rate of 

apoptosis. Modulators of CARD-3. CARD04. CARD-5. or CARD-6 expression 
or activity can be used to treat or diagnose such disorders. For example, 
populations of cells are often depleted in the event of viral infection, with perhaps 
the most dramatic example being the cell depletion caused by the human 

2 5 immunodeficiency virus (HIV). Surprisingly, most T cells that die during HIV 

infections do not appear to be infected with HIV. Although a number of 
explanations have been proposed, recent evidence suggests that stimulation of the 
CD4 receptor results in the enhanced susceptibility of uninfected T cells to 
undergo apoptosis. A wide variety of neurological diseases are characterized by 

3 0 the gradual loss of specific sets of neurons. Such disorders include Alzheimer's 
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disease, Parkinson's disease, amyotrophic lateral sclerosis (ALS) retinitis 
pigmentosa, spinal muscular atrophy, and various forms of cerebellar 
degeneration. The cell loss in these diseases does not induce an inflammatory 
response, and apoptosis appears to be the mechanism of cell death. In addition, a 
5 number of hematologic diseases are associated with a decreased production of 
blood cells. These disorders include anemia associated with chronic disease, 
aplastic anemia, chronic neutropenia, and the myelodysplastic syndromes. 
Disorders of blood cell production, such as myelodysplastic syndrome and some 
forms of aplastic anemia, are associated with increased apoptotic cell death 
1 0 within the bone marrow. These disorders could result from the activation of 
genes that promote apoptosis. acquired deficiencies in stromal cells or 
hematopoietic survival factors, or the direct effects of toxins and mediators of 
immune responses. Two common disorders associated with cell death are 
myocardial infarctions and stroke. In both disorders, cells within the central area. 
15 of ischemia, which is produced in the event of acute loss of blood flow, appear to 
die rapidlv as a result of necrosis. However, outside the central ischemic zone, 
cells die over a more protracted time period and morphologically appear to die by 
apoptosis. 

Proteins containing a CARD domain are thought to be involved in 
2 0 various inflammatory disorders. Accordingly. CARD-3. C ARD-4. C ARD-5, and 
CARD-6 polypeptides, nucleic acids and modulators of CARD-3. CARD-4. 
CARD-5. or CARD-6 expression or activity can be used to treat immune 
disorders. Such immune disorders include, but are not limited to. chronic 
inflammatory diseases and disorders, such as Crohn's disease, reactive arthritis. 

2 5 including Lyme disease, insulin-dependent diabetes, organ-specific 

autoimmunity, including multiple sclerosis. Hashimoto's thyroiditis and Grave's 
disease, contact dermatitis, psoriasis, graft rejection, graft versus host disease, 
sarcoidosis, atopic conditions, such as asthma and allergy, including allergic 
rhinitis, gastrointestinal allergies, including food allergies, eosinophilia. 

3 0 conjunctivitis, glomerular nephritis, certain pathogen susceptibilities such as 
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helminthic (e.g., leishmaniasis), certain viral infections, including HIV. and 
bacterial infections, including tuberculosis and lcpromatous leprosy. 

In addition to the aforementioned disorders, CARD-3, CARD-4, 
CARD-5. and CARD-6 polypeptides, nucleic acids, and modulators of CARD-3, 
5 C ARD-4, CARD-5 or CARD-6 expression or activity can be used to treat 

disorders of cell signaling and disorders of tissues in which CARD-3. CARD-4, 
CARD-5 or CARD-6 is expressed. 

The invention features a nucleic acid molecule which is at least 45% 
(or 55%. 65%, 75%. 85%, 95%, or 98%) identical to the nucleotide sequence 

10 shown in SEQ ID NO:L SEQ ID NO:3. SEQ ID NO:7, SEQ ID NO:9, SEQ 

ID:25. SEQ ID NO:27. SEQ ID NO:38, SEQ ID NO:40. SEQ ID NO:42. SEQ ID 
NO:48. SEQ ID NO:50. SEQ ID NO:51 . SEQ ID NO:53, SEQ ID NO:54, SEQ 
ID NO:56. SEQ ID NO:60. SEQ ID NO:62. the nucleotide sequence of the 
cDNA insert of the plasmid deposited with the ATCC as Accession Number 

15 203037 (the "cDNA of ATCC 203037"). the nucleotide sequence of the cDNA 
insert of the plasmid deposited with the ATCC as Accession Number 203035 (the 
"cDNA of ATCC 203035"), the nucleotide sequence of the cDNA insert -of the 
plasmid deposited with the ATCC as Accession Number 203036 (the "cDNA of 
ATCC 203036"), the nucleotide sequence of the cDNA insert of the plasmid 

2 0 deposited with the ATCC as Accession Number PTA-21 1 (the "cDNA of ATCC 
PTA-21 1"). the nucleotide sequence of the cDNA insert of the plasmid deposited 
with the ATCC as Accession Number PTA-21 2 ("the cDNA of ATCC PTA- 
21 2"). the nucleotide sequence of the cDNA insert of the plasmid deposited with 
the ATCC as Accession Number PTA-21 3 (the "cDNA of ATCC PTA-21 3"). or 

2 5 a complement thereof. 

The invention features a nucleic acid molecule which includes a 
fragment of at least 150 (300. 325. 350. 375. 400. 425. 450. 500. 550. 600. 650. 
700. 800. 900. 1 000. 1 300. 1 600 or 1 93 1 ) nucleotides of the nucleotide sequence 
shown in SEQ ID NO: 1 . or SEQ ID NO:3. or the nucleotide sequence of the 

3 0 cDNA ATCC 203037. or a complement thereof. 
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The invention also features a nucleic acid molecule which includes a 
fragment of at least 150 (350, 400, 450, 500. 550, 600, 650, 700, 800. 900, 1 000, 
1300. 1600, 1900. 2100, 2400. 2700, 3000. or 3382) nucleotides of the nucleotide 
sequence shown in SEQ ID NO:7, SEQ ID NO:9, or the nucleotide sequence of 
5 the cDN A ATCC 203035, or a complement thereof. 

Also within the invention is a nucleic acid molecule which includes a 
fragment of at least 150 (350. 400, 450, 500, 550. 600. 650. 700. 800. 900, 1000. 
1300. 1600. 1900. 2 100, 2400. 2700, and 3080) nucleotides of the nucleotide 
sequence shown in SEQ ID NO:25, SEQ ID NO:27. SEQ ID NO:38. SEQ ID 
10 NO:40. or the nucleotide sequence of the cDNA of ATCC 203036. or a 
complement thereof. 

The invention also features a nucleic acid molecule which includes a 
fragment of at least 150 (350. 400. 450, 500, 550. 600, 650. 700. and 761) 
nucleotides of the nucleotide sequence shown in SEQ ID NO:60, SEQ ID NO:62. 
15 or the nucleotide sequence of the cDNA of ATCC PTA-212. or a complement 
thereof. 

The invention also features a nucleic acid molecule which includes a 
fragment of at least 150 (350. 400. 450. 500. 550. 600, 650. 700. and 740) 
nucleotides of the nucleotide sequence shown in SEQ ID NO:48. SEQ ID NO:50. 
2 0 the cDNA of ATCC PTA-213. or a complement thereof. 

The invention also features a nucleic acid molecule which includes a 
fragment of at least 150 (350.400.450, 500.600. 700.800. 900. 1000, 1500. 
2000. 2500, 3000. 3500. 4000. 4500. 5000, and 5252) nucleotides of the 
nucleotide sequence shown in SEQ ID NO:5 1 . SEQ ID NO:53. or a complement 

2 5 thereof. 

The invention also features a nucleic acid molecule which includes a 
fragment of at least 150 ( 200. 300. 400. 500. 600. 700. 800. 900, 1000, 1400. 
1 80U. 2200. 2600. or 3000) nucleotides of the nucleotide sequence shown in SEQ 
ID NO:54. SEQ ID NO:56. the cDNA of ATCC PTA-213. or a complement 

3 0 thereof. 
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The invention features a nucleic acid molecule which includes a 
nucleotide sequence encoding a protein having an amino acid sequence that is at 
least 45% (or 55%. 65%, 75%, 85%, 95%, or 98%) identical to the amino acid 
sequence of SEQ ID NO:2, SEQ ID NO:8, SEQ ID NO:26. SEQ ID NO:39. SEQ 
5 ID NO:41, SEQ ID NO:43, SEQ ID NO:49. SEQ ID NO:52. SEQ ID NO:55, 
SEQ ID NO:6K or the amino acid sequence encoded by the cDNA of ATCC 
203037, the amino acid sequence encoded by the cDNA of ATCC 203035, the 
amino acid sequence encoded by the cDNA of ATCC 203036, the amino acid 
sequence encoded by the cDN A of ATCC PTA-2 1 1 . the amino acid sequence 

1 0 encoded by the cDNA of ATCC PTA-2 12. or the amino acid sequence encoded 
by the cDNA of ATCC PTA-2 13. 

In an embodiment, a CARD-3 nucleic acid molecule has the 
nucleotide sequence shown in SEQ ID NO:K or SEQ ID NO:3. or the nucleotide 
sequence of the cDNA of ATCC 203037. 

15 In another embodiment, a CARD-4L nucleic acid molecule has the 

nucleotide sequence shown in SEQ ID NO:7, or SEQ ID NO:9. or, the nucleotide 
sequence of the cDNA of ATCC 203035. 

In yet another embodiment, a CARD-4S nucleic acid molecule has the 
nucleotide sequence shown in SEQ ID NO:25. or SEQ ID NO:27. or the 

2 0 nucleotide sequence of the cDNA of ATCC 203036. In another embodiment, a 
murine CARD-4L nucleic acid molecule has the nucleotide sequence shown in 
SEQ IDNO:42. 

In another embodiment, a CARD-4Y nucleic acid molecule has the 
nucleotide sequence shown in SEQ ID NO:38. 

2 5 In another embodiment, a CARD-4Z nucleic acid molecule has the 

nucleotide sequence shown in SEQ ID NO:40, 

In another embodiment, a human CARD-5 nucleic acid molecule has 
the nucleotide sequence shown in SEQ ID NO:48. SEQ ID NO:50 or the 
nucleotide sequence of the cDNA of ATCC PTA-2 1 3. In another embodiment, a 

3 0 murine CARD-5 nucleic acid molecule has the nucleotide sequence shown in 
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SEQ ID NO:60 or SEQ ID NO:62. 

In yet another embodiment, a rat CARD-6 nucleic acid molecule has 
the nucleotide sequence shown in SEQ ID NO:5K SEQ ID NO:53, or the 
nucleotide sequence of the cDNA of ATCC PTA-21 1 . 
5 In still another embodiment, a human CARD-6 nucleic acid molecule 

has the nucleotide sequence shown in SEQ ID NO:54. SEQ ID NO:56, or the 
nucleotide sequence of the cDNA of ATCC PTA-21 3. 

Also within the invention is a nucleic acid molecule which encodes a 
fragment of a polypeptide having the amino acid sequence of SEQ ID NO:2, SEQ 

1 0 ID NO:8. SEQ ID NO:26. SEQ ID NO:39, SEQ ID NO:4 1 , SEQ ID NO:43, SEQ 
ID NO:49. SEQ ID NO:52, SEQ ID NO:55, SEQ ID NO:61, the fragment 
including at least 15 (25, 30, 50, 100, 150, 300. 400 or 540, 600, 700, 800, 900) 
contiguous amino acids of SEQ ID NO:2, SEQ ID NO:8, SEQ ID NO:26. SEQ 
ID NO:39. SEQ ID NO:4K SEQ ID NO:43, SEQ ID NO:49, SEQ ID NO:52, or 

15 SEQ ID NO:55. SEQ ID NO:61, the polypeptide encoded by the cDNA of ATCC 
Accession Number 203037, the polypeptide encoded by the cDNA of ATCC 
Accession Number 203035. the polypeptide encoded by the cDNA of ATCC 
Accession Number 203036. the polypeptide encoded by the cDNA of ATCC 
Accession Number PTA-21 1. the polypeptide encoded by the cDNA of ATCC 

2 0 Accession Number PTA-2 1 2. or the polypeptide encoded by the cDN A of ATCC 
Accession Number PTA-21 3, 

The invention includes a nucleic acid molecule which encodes a 
naturally occurring allelic variant of a polypeptide comprising the amino acid 
sequence of SEQ ID NO:2. SEQ ID NO:8. SEQ ID NO:26. SEQ ID NO:39. SEQ 

2 5 ID NO:41. SEQ ID NO:43. SEQ ID NO:49. SEQ ID NO:52. SEQ ID NO:55, 

SEQ ID NO:61 . or an amino acid sequence encoded by the cDNA of ATCC 
Accession Number 203037. 203035. 203036. PTA-21 1. PTA-212. or PTA-21 3. 
wherein the nucleic acid molecule hybridizes to a nucleic acid molecule 
consisting of SEQ ID NOT. SEQ ID NO:3. SEQ ID NO:7. SEQ ID NO:9. SEQ 

3 0 ID:25. SEQ ID NO:27. SEQ ID NO:38. SEQ ID NO:40. SEQ ID NO:42. SEQ ID 
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NO:48 ? SEQ ID NO:50, SEQ ID NO:5 1 , SEQ ID NO:53. SEQ ID NO:54, SEQ 
ID NO:56, SEQ ID NO:60, SEQ ID NO:62. the cDNA of ATCC 203037. the 
cDNA of ATCC 203035, the cDNA of ATCC 203036, the cDNA of ATCC PTA- 
211. the cDNA of ATCC PTA-2 1 2, or the cDN A of PT A-2 1 3 under stringent 
5 conditions. 

In general, an allelic variant of a gene will be readily identifiable as 
mapping to the same chromosomal location as said gene. For example, in 
Example 6. the chromosomal location of the human CARD-4 gene is discovered 
to be chromosome 7 close to the SHGC-31 928 genetic marker. Allelic variants 

10 of human CARD-4 will be readily identifiable as mapping to the human CARD-4 
locus on chromosome 7 near genetic marker SHGC-3 1928. - 

Also within the invention are: an isolated CARD-3 protein having an £ : 
amino acid sequence that is at least about 65%. preferably 75%. 85%. 95%. or v 
98% identical to the amino acid sequence of SEQ ID NO:2; an isolated CARD-3 

15 protein having an amino acid sequence that is at least about 85%. 95%. or 98% ^ 
identical to the kinase domain of SEQ ID NO:2 (e.g.. about amino acid residues 1 ~+ 
to 300 of SEQ ID NO:2; SEQ ID NO:4): and an isolated CARD-3 proteiiUiaving 
an amino acid sequence that is at least about 85%. 95%. or 98% identical to the 
linker domain of SEQ ID NO:2 (e.g.. about amino acid residues 301 to 43 1 of 

2 0 SEQ ID NO:2; SEQ ID NO:5); an isolated CARD-3 protein having an amino acid 
sequence that is at least about 85%. 95%. or 98% identical to the CARD domain 
of SEQ ID NO:2 (e.g.. about amino acid residues 432 to 540 of SEQ ID NO:2: 
SEQ ID NO:6); an isolated CARD-4 L protein having an amino acid sequence 
that is at least about 65%. preferably 75%. 85%. 95%. or 98% identical to the 

2 5 amino acid sequence of SEQ ID NO:8: an isolated CARD-4L protein having an 

amino acid sequence that is at least about 85%. 95%. or 98%) identical to the 
CARD domain of SEQ ID NO:8 (e.g.. about amino acid residues 15 to 1 14 of 
SEQ ID NO:8; SEQ ID NO: 10); an isolated CARD-4 L protein having an amino 
acid sequence that is at least about 85%. 95%. or 98% identical to the nucleotide 

3 0 binding domain of SEQ ID NO:8 (e.g.. about amino acid residues 198 to 397 of 
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SEQ ID NO:8; SEQ ID NO: 1 1 ; an isolated CARD-4L protein having an amino 
acid sequence that is at least about 85%, 95%. or 98% identical to the kinase la 
(P-loop) subdomain SEQ ID NO:8 (e.g.. about amino acid 127 to about amino 
acid 212 of SEQ ID NO:8: SEQ ID NO:46j: an isolated CARD-4L protein having 
5 an amino acid sequence that is at least about 85%, 95%, or 98% identical to the 
kinase 2 subdomain of SEQ ID NO:8 (e.g., about amino acid 273 to about amino 
acid 288 of SEQ ID NO:8; SEQ ID NO:47); an isolated CARD-4L protein having 
an amino acid sequence that is at least about 85%, 95%, or 98% identical to a 
kinase 3a subdomain of SEQ ID NO:8 (e.g., about amino acid residues 327 to 

10 338 of SEQ ID NO:8; SEQ ID NO: 14); an isolated CARD-4L protein having an 
amino acid sequence that is at least about 85%, 95%. or 98% identical to the 
Leucine-rich repeats of SEQ ID NO:8 (e.g., about amino acid residues 674 to 701 
of SEQ ID NO:8; SEQ ID NO:l 5; from amino acid 702 to amino acid 727 of 
SEQ ID NO:8; SEQ ID NO: 1 6: which extends from amino acid 728 to amino 

1 5 acid 754 SEQ ID NO:8; SEQ ID NO: 1 7: from amino acid 755 to amino acid 782 
of SEQ ID NO:8; SEQ ID NO: 18; from amino acid 783 to amino acid 810 of 
SEQ ID NO:8; SEQ ID NO: 19; from amino acid 81 1 to amino acid 838 of SEQ 
ID NO:8; SEQ ID NO:20 from amino acid 839 to amino acid 866 of SEQ ID 
NO:8: SEQ ID NO:21: from amino acid 867 to amino acid 894 of SEQ ID NO:8; 

2 0 SEQ ID NO:22; from amino acid 895 to amino acid 922 of SEQ ID NO:8; SEQ 
ID NO:23; and from amino acid 923 to amino acid 950 of SEQ ID NO:8; SEQ ID 
NO:24); an isolated CARD-4S protein having an amino acid sequence that is at 
least about 65%, preferably 75%. 85%. 95%, or 98% identical to the amino acid 
sequence of SEQ ID NO:26; an isolated CARD-4S protein having an amino acid 

2 5 sequence that is at least about 85%. 95%. or 98% identical to the CARD domain 
of SEQ ID NO:26 (e.g.. about amino acid residues 1 to 74 of SEQ ID NO:26; 
SEQ ID NO:28). Also within the invention arc: an isolated murine CARD-4L 
protein having an amino acid sequence that is ai least about 65%. preferably 75%. 
85%. 95%. or 98% identical to the amino acid sequence of SEQ ID NO:43. Also 

3C within the invention are: an isolated CARD-4Y protein having an amino acid 
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sequence that is at least about 65%, preferably 75%, 85%, 95% ? or 98% identical 
to the amino acid sequence of SEQ ID NO:39. Also within the invention are: an 
isolated CARD-4Z protein having an amino acid sequence that is at least about 
65%, preferably 75%, 85%, 95%, or 98% identical to the amino acid sequence of 
5 SEQ IDNO:41. 

Also within the invention are: an isolated CARD-5 protein having an 
amino acid sequence that is at least about 65%, preferably 75%, 85%. 95%, or 
l )8°«» identical to the amino acid sequence of SEQ ID NO:49 and an isolated 
CARD-5 protein comprising an amino acid sequence that is at least about 90%. 
10 4 )5 u o. or i)8% identical to SEQ ID NO:58 (CARD domain). 

Also within the invention are an isolated CARD-5 protein having an 
amino acid sequence that is at least about 65%. preferably 75%. 85%. 95%. or 
98% identical to the amino acid sequence of SEQ ID NO:60 and an isolated 
CARD-5 protein comprising an amino acid sequence that is at least about 90%. 
15 95%. or 98% identical to SEQ ID NO:57 (CARD domain). 

The invention also includes: an isolated CARD-6 protein having an 
amino acid sequence that is at least about 65%. preferably 75%. 85%. 95%. or 
98% identical to the amino acid sequence of SEQ ID NO:52 and an isolated 
CARD-6 protein having an amino acid sequence that is at least about 90%. 95%. 
2 0 or 98% identical to SEQ ID NO:59 (CARD domain). 

The invention also includes: an isolated CARD-6 protein having an 
amino acid sequence thai is at least about 65%. preferably 75%. 85%, 95%, or 
98% identical to the amino acid sequence of SEQ ID NO:55 and an isolated 
CARD-6 protein having an amino acid sequence that is at least about 90%, 95%. 

2 5 or 98% identical to SEQ ID NO:64 (CARD domain). . 

Also within the invention arc: an isolated CARD-3 protein which is 
encoded by a nucleic acid molecule having a nucleotide sequence that is at least 
about 65%. preferably 75%. 85%, or 95% identical to SEQ ID NO:3 or the cDNA 
of ATCC 203037; an isolated CARD-3 protein which .is encoded by a nucleic 

3 0 acid molecule having a nucleotide sequence at least about 65% preferably 75%. 
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85%. or 95% identical to the kinase domain encoding portion of SEQ ID NO:l 
(e.g., about nucleotides 213 to 1 1 13 of SEQ ID NO:l); an isolated CARD-3 
protein which is encoded by a nucleic acid molecule having a nucleotide 
sequence at least about 65% preferably 75%, 85%. or 95% identical the linker 
5 domain encoding portion of SEQ ID NO: 1 (e.g., about nucleotides 1 1 1 4 to 1 506 
of SEQ ID NO: 1 ): and an isolated CARD-3 protein which is encoded by a 
nucleic acid molecule having a nucleotide sequence at least about 65% preferably 
75%, 85%, or 95% identical the CARD domain encoding portion of SEQ ID 
NO: 1 (e.g., about nucleotides 1 507 to 1 833 of SEQ ID NO: 1 ); and an isolated 

1 0 CARD-3 protein which is encoded by a nucleic acid molecule having a 

nucleotide sequence which hybridizes under stringent hybridization conditions to 
a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:3 or the 
non-coding strand of the cDNA of ATCC 203037. Also within the invention are: 
an isolated CARD-4Y protein which is encoded by a nucleic acid molecule 

15 having a nucleotide sequence that is at least about 65%. preferably 75%, 85%, or 
95% identical to SEQ ID NO:38. Also within the invention are nucleic acid 
molecules which include about nucleotides 2759 to 2842 of SEQ ID NO:7: about 
nucleotides 2843 to 2926 of SEQ ID NO:7: about nucleotides 2927 to 3010 of 
SEQ ID NO:7: about nucleotides 301 1 to 3094 of SEQ ID NO:7; and an isolated 

2 0 CARD-4L protein which is encoded by a nucleic acid molecule having a 

nucleotide sequence which hybridizes under stringent hybridization conditions to 
a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:9. or the 
non-coding strand of the cDNA of ATCC 203035. 

Also within the invention are an isolated CARD-4S protein which is 

2 5 encoded by a nucleic acid molecule having a nucleotide sequence that is at least 

about 65%. preferably 75%. 85%. or 95% identical to SEQ ID NO:27: an isolated 
CARD-3 protein which is encoded by a nucleic acid molecule having a 
nucleotide sequence at least about 65% preferably 75%. 85%. or 95% identical 
the CARD domain encoding portion of SEQ ID NO:25 (e.g.. about nucleotides ] 

3 0 lo 222 of SEQ ID NO:25j: an isolated CARD-3 protein which is encoded by a 
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nucleic acid molecule having a nucleotide sequence at least about 65% preferably 
75%; 85%. or 95% identical the P-Loop encoding portion of SEQ ID NO:25 
(e.g.. about nucleotides 485 to 510 of SEQ ID NO:25). 

Also within the invention are an isolated CARD-5 protein which is 
5 encoded by a nucleic acid molecule having a nucleotide sequence that is at least 
about 65%, preferably 75%, 85%, or 95% identical to SEQ ID NO:48 or the 
cDNA of ATCC PTA-213; an isolated CARD-5 protein which is encoded by a 
nucleic acid molecule having a nucleotide sequence at least about 90% preferably 
95%. or 98% identical to the CARD encoding portion of SEQ ID NO:48 (e.g., 

10 about nucleotides 383 to 596 of SEQ ID NO:48): and an isolated CARD-5 
protein which is encoded by a nucleic acid molecule having a nucleotide 
sequence which hybridizes under stringent hybridization conditions to a nucleic 
acid molecule having the nucleotide sequence of SEQ ID NO:48 or the non- 
coding strand of the cDNA of ATCC PTA-213. 

15 Also within the invention are an isolated CARD-5 protein which is 

encoded by a nucleic acid molecule having a nucleotide sequence that is at least 
about 65%. preferably 75%. 85%, or 95% identical to SEQ ID NO:60; an isolated 
CARD-5 protein which is encoded by a nucleic acid molecule having a 
nucleotide sequence at least about 90% preferably 95%. or 98% identical to the 

2 0 CARD encoding portion of SEQ ID NO:60 (e.g.. about nucleotides 4 1 6 to 625 of 
SEQ ID NO:60); and an isolated CARD-5 protein which is encoded by a nucleic 
acid molecule having a nucleotide sequence which hybridizes under stringent 
hybridization conditions to a nucleic acid molecule having the nucleotide 
sequence of SEQ ID NO:60. 

2 5 Also within the invention are an isolated CARD-6 protein which is 

encoded by a nucleic acid molecule having a nucleotide sequence that is at least 
about 65%, preferably 75%. 85%. or 95% identical to SEQ ID NO:51 ; an isolated 
CARD-6 protein which is encoded by a nucleic acid molecule having a 
nucleotide sequence at least about 90% preferably 95%. or 98% identical to the 

3 0 CARD encoding portion of SEQ ID NO:51 (e.g.. about nucleotides 169 to 456 of 
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SEQ ID NO:51); and an isolated CARD-6 protein which is encoded by a nucleic 
acid molecule having a nucleotide sequence which hybridizes under stringent 
hybridization conditions to a nucleic acid molecule having the nucleotide 
sequence of SEQ IDNO:51. 
5 Also within the invention are an isolated CARD-6 protein which is 

encoded by a nucleic acid molecule having a nucleotide sequence that is at least 
about 65%. preferably 75%, 85%, or 95% identical to SEQ ID NO:54; an isolated 
CARD-6 protein which is encoded by a nucleic acid molecule having a 
nucleotide sequence at least about 90% preferably 95%, or 98% identical to the 
1 o CARD encoding portion of SEQ ID NO:54: and an isolated CARD-6 protein 
which is encoded by a nucleic acid molecule having a nucleotide sequence which 
hybridizes under stringent hybridization conditions to a nucleic acid molecule 
having the nucleotide sequence of SEQ ID MO:54. 

Another embodiment of the invention features CARD-3. CARD-4, 
1 5 CARD-5. or CARD-6 nucleic acid molecules which specifically detect CARD-3. 
CARD-4. CARD-5. or CARD-6 nucleic acid molecules, relative to nucleic acid 
molecules encoding other members of the CARD supcrfamily. For example, in 
one embodiment, a CARD-4L nucleic acid molecule hybridizes under stringent 
conditions to a nucleic acid molecule comprising the nucleotide sequence of SEQ 
20 ID NO:7. SEQ ID NO:9. or the cDNA of ATCC 203035. or a complement 

thereof. In another embodiment, the CARD-4L nucleic acid molecule is at least 
300 (350. 400. 450. 500. 550. 600. 650. 700. 800. 900. 1000. 1300. 1600. 1900. 
2100. 2400. 2700. 3000. or 3382) nucleotides in length and hybridizes under 
stringent conditions to a nucleic acid molecule comprising the nucleotide 

2 5 sequence shown in SEQ ID NO:7. SEQ ID NO:9. the cDNA of ATCC 203035. or 

a complement thereof. In another embodiment, an isolated CARD-4L nucleic 
acid molecule comprises nucleotides 287 to 586 of SEQ ID NO:7. encoding the 
CARD domain of CARD-4L. or a complement thereof. In yet another 
embodiment, the invention provides an isolated nucleic acid molecule which is 

3 0 aniisense to the coding strand of a CARD-4 L nucleic acid. 
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In another embodiment, a CARD-5 nucleic acid molecule hybridizes 
under stringent conditions to a nucleic acid molecule comprising the nucleotide 
sequence of SEQ ID NO:48. SEQ ID NO:50, or the cDNA of ATCC PTA-213. or 
a complement thereof. In another embodiment, the CARD-5 nucleic acid 
5 molecule is at least 300 (350, 400, 450, 500. 550, 585. 600. 650. 700. or 740) 
nucleotides in length and hybridizes under stringent conditions to a nucleic acid 
molecule comprising the nucleotide sequence shown in SEQ ID NO:48. SEQ ID 
NO:50. the cDNA of ATCC PTA-213, or a complement thereof. In another 
embodiment, an isolated CARD-5 nucleic acid molecule comprises nucleotides 

10 383 to 596 of SEQ ID NO:48. encoding the CARD of CARD-5. In yet another 
embodiment, the invention provides an isolated nucleic acid molecule which is 
antisense to the coding strand of a CARD-5 nucleic acid. 

Another aspect of the invention provides a vector, e.g.. a recombinant 
expression vector, comprising a CARD-3. CARD-4, CARD-5. or CARD-6 

1 5 nucleic acid molecule of the invention. In another embodiment the invention 
provides a host cell containing such a vector. The invention also provides a 
method for producing CARD-3. CARD-4. CARD-5. or CARD-6 protein by ; 
culturing. in a suitable medium, a host cell of the invention containing a 
recombinant expression vector such that a CARD-3. CARD-4. CARD-5. or 

2 0 CARD-6 protein is produced. 

Another aspect of this invention features isolated or recombinant 
CARD-3. CARD-4. CARD-5. or CARD-6 proteins and polypeptides. Preferred 
CARD-3. CARD-4. CARD-5. or CARD-6 proteins and polypeptides possess at 
least one biological activity possessed by naturally occurring human CARD-3. 

2 5 CARD-4. CARD-5. or CARD-6, e.g.. (1 ) the ability to form protein:protein 

interactions with proteins in the apoptotic signaling pathway; (2) the ability to 
form CARD-CARD interactions with proteins in the apoptotic signaling pathway; 
(3) the ability to bind a CARD-3. CARD-4. CARD-5. or CARD-6 ligand: and (4 ) 
the ability to bind to an intracellular target. Other activities include: ( 1 ) 

3 0 modulation of cellular proliferation: (2) modulation of cellular differentiation: (3 ) 
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modulation of cellular death; and (4) modulation of the NF-kB pathway. 

The CARD-3 , CARD-4, CARD-5, or CARD-6 proteins of the present 
invention, or biologically active portions thereof, can be operatively linked to a 
non-CARD-3, non-CARD-4, non-CARD-5 ? or non-CARD-6 polypeptide (e.g.. 
5 heterologous amino acid sequences) to form CARD-3. CARD-4, CARD-5, or 
CARD-6 fusion proteins, respectively. The invention further features antibodies 
that specifically bind CARD-3. CARD-4, CARD-5, or CARD-6 proteins, such as 
monoclonal or polyclonal antibodies. In addition, the CARD-3. CARD-4. 
CARD-5. or CARD-6 proteins or biologically active portions thereof can be 
1 o incorporated into pharmaceutical compositions, which optionally include 
pharmaceutical!} acceptable carriers. 

In another aspect, the present invention provides a method for 
detecting the presence of CARD-3, CARD-4, CARD-5. or CARD-6 activity or 
expression in a biological sample by contacting the biological sample with an 
15 agent capable of detecting an indicator of CARD-3. CARD-4, CARD-5, or 
CARD-6 activity such that the presence of CARD-3. CARD-4, CARD-5. or 
CARD-6 activity is detected in the biological sample. 

In another aspect, the invention provides a method for modulating 
CARD-3. CARD-4. CARD-5, or CARD-6 activity comprising contacting a cell 
2 0 with an agent that modulates (inhibits or stimulates) CARD-3. CARD-4. CARD- 
5. or CARD-6 activity or expression such that CARD-3. CARD-4. CARD-5. or 
CARD-6 activity or expression in the cell is modulated. In one embodiment, the 
agent is an antibody that specifically binds to CARD-3. CARD-4. CARD-5. or 
CARD-6 protein. In another embodiment, the agent modulates expression of 

2 5 CARD-3. CARD-4. CARD-5. or CARD-6 by modulating transcription of a 

CARD-3. CARD-4. CARD-5. or CARD-6 gene, splicing of a CARD-3. CARD- 

4. CARD-5. or CARD-6 mRNA. or translation of a CARD-3, CARD-4. CARD- 

5. or CARD-6 mRNA. In yet another embodiment, the agent is a nucleic acid 
molecule having a nucleotide sequence that is antisensc to the coding strand oi 

3 0 the CARD-3. CARD-4. CARD-5. or CARD-6 mRNA or the CARD-3. CARD-4. 
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CARD-5. or CARD-6 gene. 

In one embodiment, the methods of the present invention are used to 
treat a subject having a disorder characterized by aberrant CARD-3. CARD-4, 
CARD-5, or CARD-6 protein or nucleic acid expression or activity or related to 
5 CARD-3, CARD-4. CARD-5, or CARD-6 expression or activity by 

administering an agent which is a CARD-3, CARD-4. CARD-5. or CARD-6 
modulator to the subject. In one embodiment, the CARD-3, CARD-4, CARD-5, 
or CARD-6 modulator is a CARD-3. CARD-4, CARD-5. or CARD-6 protein. In 
another embodiment the CARD-3. CARD-4. CARD-5. or CARD-6 modulator is 

10 a CARD-3. CARD-4. CARD-5. or CARD-6 nucleic acid molecule. In other 
embodiments, the CARD-3, CARD-4. CARD-5. or CARD-6 modulator is a 
peptide, peptidomimetic. or other small molecule. 

The present invention also provides a diagnostic assay for identifying \ 
the presence or absence of a genetic lesion or mutation characterized by at least 

15 one of: (i) aberrant modification or mutation of a gene encoding a CARD-3. 

CARD-4, CARD-5, or CARD-6 protein; (ii) mis-regulation of a gene encoding a ;r 
CARD-3. CARD-4. CARD-5. or CARD-6 protein: (iii) aberrant RNA splicing: 
and ( iv) aberrant post-translational modification of a CARD-3. CARD-4. CARD- 
5. or CARD-6 protein, wherein a wild-type form of the gene encodes a protein 

2 0 with a CARD-3. CARD-4. CARD-5. or CARD-6 activity.. 

In another aspect, the invention provides a method for identify ing a 
compound that binds to or modulates the activity of a CARD-3. CARD-4. 
CARD-5. or CARD-6 protein. In general, such methods entail measuring a 
biological activity of a CARD-3. CARD-4. CARD-5. or CARD-6 protein in the 

2 5 presence and absence of a test compound and identifying those compounds which 

alter the activity of the CARD-3. CARD-4. CARD-5. or CARD-6 protein. 

The invention also features methods for identifying a compound 
which modulates the expression of CARD-3. CARD-4. CARD-5. or CARD-6 by 
measuring the expression of CARD-3. CARD-4. CARD-5. or CARD-6 in the 

3 0 presence and absence of a compound. 
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Other features and advantages of the invention will be apparent from 
the following detailed description and claims. 

Brief Description of the Drawings 
5 Figure 1 depicts the cDNA sequence (SEQ ID NO: 1 ) of human 

CARD-3. The open reading frame of CARD-3 (SEQ ID NO:l ) extends from 
nucleotide 213 to nucleotide 1833 nucleotide (SEQ ID NO:3). 

Figure 2 depicts the predicted amino acid sequence (SEQ ID NO:2) of 
human CARD-3. 

l o Figure 3 depicts the cDNA sequence (SEQ ID NO:7) of CARD-4L. 

The open reading frame of SEQ ID NO:7 extends from nucleotide 245 to 
nucleotide 3103 (SEQ ID NO:9). 

Figure 4 depicts the predicted amino acid sequence (SEQ ID NO:8) of 
human CARD-4L. 

1 s Figure 5 depicts the partial cDNA sequence (SEQ ID NO:25) of 

CARD-4S and the predicted amino acid sequence (SEQ ID NO:25) of human 
CARD-4S. The open reading frame of CARD-4 (SEQ ID NO:25) extends from 
nucleotide 1 to nucleotide 1470 (SEQ ID NO:27). 

Figure 6 depicts the predicted amino acid sequence (SEQ ID NO:26) 

2 0 of human CARD-4S. 

Figure 7 depicts an alignment of the CARD domains of CARD-4 
(SEQ ID NO: 10). CARD-3 (SEQ ID NO:6). ARC-CARD (SEQ ID NO:3 1 ). 
cIAPl-CARD (SEQ ID NO:32), and cIAP2-CARD (SEQ ID NO:33). 

Figure 8 is a plot showing predicted structural features of human 

2 5 CARD-4L 

Figure 9 is a plot showing predicted structural features of human 

CARD-4S. 

Figure 10 depicts the cDNA sequence (S0Q ID NO:38) of the human 
CARD-4Y splice variant clone. The predicted open reading frame of the human 

3 0 CARD-4Y splice variant clone extends from nucleotide 438 to nucleotide 1 1 84. 
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Figure 1 1 depicts the amino acid sequence (SEQ ID NO: 3 9) of the 
protein predicted to be encoded by the human CARD-4Y cDNA open reading 
frame. 

Figure 12 depicts the cDNA sequence (SEQ ID NO:40) of the human 
5 CARD-4Z splice variant clone. The predicted open reading frame of the human 
CARD-4Z splice variant clone extends from nucleotide 489 to nucleotide 980. 

Figure 13 depicts the amino acid sequence (SEQ ID NO:41) of the 
protein predicted to be encoded by the human CARD-4Z cDNA open reading 
frame. 

10 Figure 14 depicts an alignment of human CARD-4L (SEQ ID NO:8 ). 

the predicted amino acid sequence of human CARD-4Y (SEQ ID NO:39); and 
the predicted amino acid sequence of human CARD-4Z (SEQ ID NO:41 ). 

Figure 15 depicts the nucleotide sequence of the murine CARD-4L 
cDNA (SEQ ID NO:42). 
15 Figure 16 depicts the predicted amino acid sequence of murine 

CARD-4L (SEQ ID NO:43). 

Figure 17 depicts an alignment of human CARD-4L (SEQ ID NO:8) 
and the predicted amino acid sequence of murine CARD-4L (SEQ ID NO:43). 

Figure 1 8 depicts a 32042 nucleotide genomic sequence of CARD-4. 
2 0 Figure 1 9 depicts the nucleotide sequence of a murine CARD-5 

cDNA (SEQ ID NO:60). The open reading frame of this cDNA extends from 
nucleotide 89 to nucleotide 667 of SEQ ID NO:60 (SEQ ID NO:62) and encodes 
a 193 amino acid protein (SEQ ID NO:61 ). 

Figure 20 depicts a hydropathy plot of murine CARD-5. Relatively 

2 5 hydrophobic residues are above the dashed horizontal line, and relatively 

hydrophilic residues are below the dashed horizontal line. The cysteine residues 
(cys) and potential N-glycosylation sites (Ngly) are indicated by short vertical 
lines just below the hydropathy trace. 

Figure 21 depicts the nucleotide sequence of a human CARD-5 cDNA 

3 0 (SEQ ID NO:48). The open reading frame of this cDNA extends from nucleotide 



BN5DOCID <WO 0 1 0O926A2 I > 



WO 01/00826 



PCT/US00/17691 



53 to nucleotide 638 of SEQ ID NO:48 (SEQ ID NO:50) and encodes a 195 
amino acid protein SEQ ID NO:49). 

Figure 22 depicts a hydropathy plot of human CARD-5. Relatively 
hydrophobic residues are above the dashed horizontal line, and relatively 
5 hydrophilic residues are below the dashed horizontal line. The cysteine residues 
(cys) and potential N-glycosylation sites (Ngly) are indicated by short vertical 
lines just below the hydropathy trace. 

Figure 23 depicts an alignment of the cDNA sequences of murine 
CARD-5 (SEQ ID NO:60) and human CARD-5 (SEQ ID NO:48). This 
10 alignment was created using ALIGN (version 2.0; PAM120 scoring matrix; -12/- 
4 gap penalty). In this alignment the sequences are 68.2% identical. 

Figure 24 depicts an alignment of the amino acid sequences of murine 
CARD-5 (SEQ ID NO:61 ) and human CARD-5 (SEQ ID NO:49). This 
alignment was created using ALIGN (version 2.0; PAM120 scoring matrix; -12/- 
15 4 gap penalty). In this alignment the sequences are 7 3 .8% identical. 

Figure 25 depicts the nucleotide sequence of a rat CARD-6 cDNA 
(SEQ ID NO;51 ). The open reading frame of this cDNA extends from nucleotide 
169 to nucleotide 2883 of SEQ ID NO:51 (SEQ ID NO:53) and encodes a 505 
amino acid protein (SEQ ID NO:52). 
2 0 Figure 26 depicts,a hydropathy plot of rat CARD-6. Relatively 

hydrophobic residues are above the dashed horizontal line, and relatively 
hydrophilic residues arc below the dashed horizontal line. The cysteine residues 
(cys) and potential N-glycosyiaiion sites (Ngly) are indicated by short vertical 
lines just below the hydropathy trace. 

2 5 Figure 27 depicts an alignment of the CARD domains of murine 

CARD-5 (SEQ ID NO:57). human CARD-5 (SEQ ID NO:58). and RAIDD (SEQ 
IDNO:61). 

Figure 28 depicts the nucleotide sequence of a human CARD-6 cDNA 
(SEQ ID NO:54). The open reading frame of this cDNA extends from nucleotide 

3 0 200 to 3310 of SEQ ID NO:54 (SEQ ID NO:56) and encodes a 1037 amino acid 
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protein (SEQ ID NO:55). 

Figure 29 depicts a hydropathy plot of human CARD-6. Relatively 
hydrophobic residues are above the dashed horizontal line, and relatively 
hydrophilic residues are below the dashed horizontal line. 
5 Figure 30 depicts an alignment of the CARD domain of human 

CARD-6 (SEQ ID NO:64) with a consensus CARD domain (SEQ ID NO:67). In 
this depiction of the consensus sequence, more conserved residues are indicated 
by uppercase letters and less conserved residues are indicated by lowercase 
letters. 

1 o Figure 3 1 depicts an alignment of the CARD domains of human 

CARD-3. human CARD-4, human CARD-5. murine CARD-5. human CARD-6. 
and rat CARD-6. This alignment was created using the Clustal method with 
PAM250 residue weight tabic. A consensus sequence is also depicted (SEQ ID 
NO: ). 

15 

Detailed Description oTthe Invention 

The present invention is based, in part, on the discovery of cDNA - 
molecules encoding human CARD-3. human CARD-4. partial murine CARD-4L. 
murine CARD-5. human CARD-5. rat CARD-6. and human CARD-6 proteins. 

20 
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TABLE 1 

Summary of CARD-3, CARD-4, CARD-5, and CARD-6 Sequence Information. 



5 



Gene 


cDNA 


Protein 


ORF 


Figure 


Accession 
Number 


human 


SEQ ID 


SEQ ID 


SEQID 


Figs. 


203037 


CARD-3 


NO:l 


NO:2 


NO:3 


1-2 




human 


SEQ ID 


SEQ ID 


SEQ ID 


Figs. 


203035 


CARD-4L 


NO:7 


NO:8 


NO:9 


3-4 




human 


SEQ ID 


SEQ ID 


SEQ ID 


Figs. 


203036 


CARD-4S 


NO:25 


NO:26 


NO:27 


5-6 




human 


SEQID 


SEQ ID 




Figs. 




CARD-4Y 


NO:38 


NO:39 




km 1 




human 


SEQ ID 


SEQ ID 




Figs. 




CARD-4Z 


NO:40 


NO:41 




12-13 




murine 


SEQ ID 


SEQID 




Figs. 




CARD-4L 


NO:42 


NO:43 




15-16 




human 


SEQ ID 


SEQID 


SEQ ID 


Fig. 21 


PTA-213 


CARD-5 


NO:48 


NO:49 


NO:50 






murine 


SEQ ID 


SEQ ID 


SEQ ID 


Fig. 19 


PTA-2 1 2 


CARD-5 


NO:60 


NO:61 


NO:62 






human 


SEQ ID 


SEQ ID 


SEQ ID 


Fig. 28 


PTA-213 


CARD-6 


NO:54 


NO:55 


NO:56 






rat 


SEQ ID 


SEQ ID 


SEQ ID 


Fig. 25 


PTA-2! 1 


CARD-6 


NO:5l 


NO:52 


NO:53 







A nucleotide sequence encoding a human CARD-3 protein is shown 
in Figure 1 (SEQ ID NO: 1; SEQ ID NO:3 includes the open reading frame only). 
A predicted amino acid sequence of CARD-3 protein is also shown in Figure 2 
10 (SEQIDNO:2). 

CARD-4 has at leasi two forms, a long form. CARD-4L. and a short 
form. CARD-4S. as well as two or more splice variants. A nucleotide sequence 
encoding a human CARD-4L protein is shown in Figure 3 (SEQ ID NO:7; SEQ 
ID NO:9 includes the open reading frame only). A predicted amino acid 
15 sequence of CARD-4L protein is also shown in Figure 4 (SEQ ID NO:8). A 
nucleotide sequence encoding a human CARD-4S protein is shown in Figure 5 
(SEQ ID NO:25: SEQ II) NO:27 includes the open reading frame only). A 
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predicted amino acid sequence of CARD-4S protein is shown in Figure 6 (SEQ 
ID NO:26). Two additional splice variants of human CARD-4 are provided in 
Figures 10 and 1 1 (human CARD-4Y) and Figures 12 and 13 (human CARD-4Z) 
(predicted amino acid sequences: SEQ ID NO:39 and SEQ ID NO:41 and nucleic 
5 acid sequences: SEQ ID NO:38 and SEQ ID NO:40). These two splice variants 
arc predicted to contain 249 and 164 amino acids, respectively. An alignment of 
human CARD-4Y, human CARD-4Z and human CARD-4L is shown in Figure 
14. 

In addition to the human CARD-4 proteins, a full length nucleotide 

1 0 sequence of the murine ortholog of human CARD-4L is provided in Figure 1 5 

( SEQ ID NO:42). An alignment of murine CARD-4L with human CARD-4L is 
shown in Figure 1 7. 

A nucleotide sequence encoding a murine CARD-5 protein is shown 
in Figure 19 (SEQ ID NO:60; SEQ ID NO:62 includes the open reading frame 
15 only). A predicted amino acid sequence of murine CARD-5 protein is also 
shown in Figure 19 (SEQ ID NO:61). 

A nucleotide sequence encoding a human CARD-5 protein is shown 
in Figure 21 (SEQ ID NO:48; SEQ ID NO:50 includes the open reading frame 
only). A predicted amino acid sequence of human CARD-5 protein is also shown 
20 in Figure 21 (SEQ ID NO:49). 

A nucleotide sequence encoding a rat CARD-6 protein is shown in 
Figure 25 (SEQ ID NO:51; SEQ ID NO:53 includes the open reading frame 
only). A predicted amino acid sequence of rat CARD-6 protein is also shown in 
Figure 25 (SEQ ID NO:52). 

2 S The human CARD-3 cDN A of Figure 1 (SEQ ID NO: 1 ). which is 

approximately 1931 nucleotides long including untranslated regions, encodes a 
protein having a molecular weight of approximately 61 kDa (excluding post- 
translational modifications). 

A plasmid containing a cDNA encoding human CARD-3 (pXEl 7A) 

3 0 was deposited with the American Type Culture Collection (ATCC). Manasass. 
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VA on May 14 ? 1998. and assigned Accession Number 203037. This deposit will 
be maintained under the terms of the Budapest Treaty on the International 
Recognition of the Deposit of Microorganisms for the Purposes of Patent 
Procedure. This deposit was made merely as a convenience for those of skill in 
5 the art and is not an admission that a deposit is required under 35 U.S.C.§1 12. 

The human CARD-4L cDNA of Figure 3 (SEQ ID NO:7), which is 
approximately 3382 nucleotides long including untranslated regions, encodes a 
protein having a molecular weight of approximately 108 kDa (excluding post- 
translational modifications). 

10 A plasmid containing a cDNA encoding human CARD-4L (pC4Ll ) 

was deposited with the American Type Culture Collection (ATCC). Manasass. 
VA on July 7. 1998, and assigned Accession Number 203035. This deposit will 
be maintained under the terms of the Budapest Treaty on the Internationa] 
Recognition of the Deposit of Microorganisms for the Purposes of Patent 

15 Procedure. This deposit was made merely as a convenience for those of skill in 
the art and is not an admission that a deposit is required under 35 U.S.C\§1 12. 

The human CARD-4S cDNA of Figure 5 (SEQ ID NO:25). which is 
3082 nucleotides long including untranslated regions. 

A plasmid containing a cDNA encoding human CARD-4S (pDB33E) 

2 0 was deposited with the American Type Culture Collection (ATCC). Manasass. 
VA on May 14. 1998. and assigned Accession Number 203036. This deposit will 
be maintained under the terms of the Budapest Treaty on the International 
Recognition of the Deposit of Microorganisms for the Purposes of Patent 
Procedure. This deposit was made merely as a convenience for those of skill in 

2 5 the an and is not an admission that a deposit is required under 35 U.S.C\§1 12. 

The human CARD-5 cDNA of Figure 21 (SEQ ID NO:48). which is 
approximately 740 nucleotides long including untranslated regions, encodes a 
protein having a molecular weight of approximately 21 .6 kD. 

A plasmid containing a cDNA encoding human CARD-5 

3 0 (EpHC5) was deposited with the American Type Culture Collection (ATCC). 

-30- 
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Manasass, VA on June 11, 1999. and assigned Accession Number PTA-21 3. 
This deposit will be maintained under the terms of the Budapest Treaty on the 
International Recognition of the Deposit of Microorganisms for the Purposes of 
Patent Procedure. This deposit was made merely as a convenience for those of 
5 skill in the art and is not an admission that a deposit is required under 35 
U.S. C. §112. 

The murine CARD-5 cDNA of Figure 19 (SEQ ID NO:60). which is 
approximately 778 nucleotides long, including untranslated regions, encodes a 
protein having a molecular weight of approximately 21.5 kD. 

10 A plasmid containing a cDNA encoding murine CARD-5 (EpMC5) 

was deposited with the American Type Culture Collection ( ATCC). Manassas. 
VA on June 11.1 999. and assigned Accession Number PTA-212. This deposit 
will be maintained under the terms of the Budapest Treaty on the International 
Recognition of the Deposit of Microorganisms for the Purposes of Patent 

15 Procedure. This deposit was made merely as a convenience for those of skill in 
the art and is not an admission that a deposit is required under 35 U.S.C.§1 12. 

The human CARD-6 cDNA of Figure 28 (SEQ ID NO:54). which is 
approximately 4244 nucleotides long encodes a protein having a molecular 
weight of approximately 1 16.5 kD (excluding post-translational modifications). 

2 0 A plasmid containing a cDN A encoding an amino terminal portion of 

human CARD-6 (EpHC6e), a plasmid encoding a carboxy terminal portion of 
human CARD-6 (EpHC6c). and a plasmid containing cDNA encoding human 
CARD-6 (EpHC6) were deposited with the American Type Culture Collection 
(ATCC). Manasass. VA on June 1 1. 1999. and assigned Accession Number PTA- 

2 5 213. This deposit will be maintained under the terms of the Budapest Treaty on 

the International Recognition of the Deposit of Microorganisms for the Purposes 
of Patent Procedure. This deposit was made merely as a convenience for those of 
skill in the art and is not an admission that a deposit is required under 35 
U.S.C,§112. 

3 0 
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The rat CARD-6 cDNA of Figure 25 (SEQ ID NO:51 ), which is 
approximately 5252 nucleotides long including untranslated regions, encodes a 
protein having a molecular weight of approximately 100.7 kD. 

A plasmid containing a cDNA encoding rat CARD-6 (EpMR5) was 
5 deposited with the American Type Culture Collection (ATCC), Manassas, VA. 
on June 10. 1999. and assigned Accession Number PTA-21 1. This deposit will 
he maintained under the terms of the Budapest Treaty on the International 
Recognition of the Deposit of Microorganisms for the Purposes of Patent 
Procedure. This deposit was made merely as a convenience for those of skill in 
10 the an and is not an admission that a deposit is required under 35 U.S.C, 1 12. 

A region of human CARD-4L protein (SEQ ID NO:8). the CARD 
domain (SEQ ID NO: 10). bears some similarity to a CARD domain of CARD-3 
(SEQ ID NO:6). ARC-CARD (SEQ ID NO:31). cIAPl-CARD (SEQ ID NO:32), 
and cIAP2-CARD (SEQ ID NO:33). This comparison is depicted in Figure 7. 
15 A region, the CARD domain (SEQ ID NO:58). of human C ARD-5 

protein (SEQ ID NO:48) and a region, the CARD domain (SEQ ID NO:57), of 
murine CARD-5 protein (SEQ ID NO:61) bear some similarity to the CARD of 
RAIDD (SEQ ID NO:70). This comparison is depicted in Figure 27. 

Each of CARD-3, CARD-4. CARD-5. and CARD-6 are members of a 
2C family of molecules (the "CARD-3 family", the "CARD-4 family", the "CARD-5 
family", and the "CARD-6 family" respectively) having certain conserved 
structural and functional features. The term "family" when referring to the 
protein and nucleic acid molecules of the invention is intended to mean two or 
more proteins or nucleic acid molecules having a common structural domain and 

2 5 having sufficient amino acid or nucleotide sequence identity as defined herein. 

Such family members can be naturally occurring and can be from either the same 
or different species. For example, a family can contain a first protein of human 
origin and a homologue of that protein of murine origin, as well as a second, 
distinct protein of human origin and a murine homologue of that protein. 

3 0 Members of a family may also have common functional characteristics. 
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In one embodiment, a CARD-3, CARD-4, CARD-5, or CARD-6 
protein includes a CARD domain having at least about 65%, preferably at least 
about 75%, and more preferably about 85%, 95%. or 98% amino acid sequence 
identity to the CARD domain of SEQ ID NO:6 or the CARD domain of SEQ ID 
5 NO: 1 0 or the CARD domain of SEQ ID NO:28. the CARD domain of SEQ ID 
NO:57, the CARD domain of SEQ ID NO:58, the CARD domain of SEQ ID 
NO:59. or the CARD domain of SEQ ID NO:64. 

Preferred CARD-3. CARD-4, CARD-5. or CARD-6 polypeptides of 
the present invention have an amino acid sequence sufficiently identical lo the 
10 CARD domain amino acid sequence of SEQ ID NO:6. SEQ ID NO: 10. SEQ ID 
NO:57. SEQ ID NO:58, SEQ ID NO:59. and SEQ ID NO:64, respectively. 

The CARD-3 polypeptide also has an amino acid sequence 
sufficiently identical to the kinase domain sequence of SEQ ID NO:4, and an 
amino acid sequence that is sufficiently identical to the linker domain of SEQ ID 
15 NO: 5. The CARD-4L polypeptide has an amino acid sequence sufficiently 
identical to the nucleotide binding domain of SEQ ID NO: 1 1 . an amino acid 
sequence sufficiently identical to the Walker Box "A" of SEQ ID NO: 1 2 or 
Walker Box " B M of SEQ ID NO: 13. an amino acid sequence sufficiently identical 
to the kinase 1 a subdomain of SEQ ID NO:46. an amino acid sequence 
2 0 sufficiently identical to the kinase 2 subdomain of SEQ ID NO:47. or an amino 
acid sequence sufficiently identical to the kinase 3a subdomain of SEQ ID 
NO: 14. or an amino acid sequence sufficiently identical to the Leucine-rich 
repeats of SEQ ID NO:15. SEQ ID NO:16. SEQ ID NO:17. SEQ ID NO:18. SEQ 
ID NO:19, SEQ ID NO:20. SEQ ID NO:21. SEQ ID NO:22. SEQ ID NO:23. and 

2 5 SKQlDNO:24. 

As used herein, the term "sufficiently identical" refers to a first amino 
acid or nucleotide sequence which contains a sufficient or minimum number of 
identical or equivalent (e.g.. an amino acid residue which has a similar side 
chain) amino acid residues or nucleotides to a second amino acid or nucleotide 

3 C sequence such that the first and second amino acid or nucleotide sequences have 
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a common structural domain and/or common functional activity. For example, 
amino acid or nucleotide sequences which contain a common structural domain 
having about 65% identity, preferably 75% identity, more preferably 85%, 95%, 
or 98% identity are defined herein as sufficiently identical. 
5 As used interchangeably herein a "CARD-3, CARD-4, CARD-5, or 

CARD-6 activity", "biological activity of CARD-3, CARD-4, CARD-5. or 
CARD-6" or "functional activity of CARD-3, CARD-4, CARD-5, or CARD-6", 
refers 10 an activity exerted by a CARD-3. CARD-4, CARD-5. or CARD-6 
protein, polypeptide or nucleic acid molecule on a CARD-3. CARD-4. CARD-5. 
10 or CARD-6 responsive cell as determined in vivo, or in vitro, according to 

standard techniques. A CARD-3, CARD-4, CARD-5, or CARD-6 activity can be 
a direct activity, such as an association with or an enzymatic activity on a second 
protein or an indirect activity, such as a cellular signaling activity mediated by 
interaction of the CARD-3, CARD-4, CARD-5, or CARD-6 protein with a 
15 second protein. In one embodiment, a CARD-3. CARD-4, CARD-5. or CARD-6 
activity includes at least one or more of the following activities: (i) the ability to 
interact with proteins in an apoptotic signaling pathway; (ii) the ability to interact 
with a CARD-3, CARD-4, CARD-5. or CARD-6 ligand; or (iii) the ability to 
interact with an intracellular target protein: (iv) the ability to interact, directly or 
2 0 indirectly with one or more with caspases; (v) the ability to modulate the activity 
of a caspasc, e.g.. caspase-9: (vi) the ability to modulate the activity of NF-kB: 
(vii) the ability to modulate Apaf-1 : (viii) the ability to modulate a Bcl-2 family 
member; (ix) the ability to modulate a neurotropin receptor, e.g.. P75 NIK ; (x) the 
ability to modulate the activity of a stress activated kinase (e.g.. JNK/p38): and 

2 5 (xi) the ability to modulate phosphorylation of CHOP (GADD 153). For 

example, in Example 4. CARD-3-containing proteins were shown to associate 
with CARD-4-containing proteins. In example 9. CARD-4 proteins were shown 
to induce NF-KB-mediated transcription. In example 10. CARD-3 and CARD-4 
were shown to enhance caspase-9 activity. 

3 0 
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CARD-4 and CARD-6 have Apaf-1 -like sequences and may bind to 
one or more members of the Bcl-2 family (e.g., Bcl-2. Boo, or Diva). CARD-3 
and CARD-5 may also bind to one or more members fo the Bcl-2 family. 
CARD-3, CARD-4. CARD-5. and CARD-6 may modulate apoplosis by 
5 influencing the activity of a Bcl-2 family member, which modulation, in turn, 
modulates activity of Apaf-1 or other factors. CARD-3, CARD-4, CARD-5. and 
CARD-6 nucleic acid and polypetpides as well as modulators of activity of 
expression of CARD-3, CARD-4. CARD-5. or CARD-6 can be used to modulate 
an Apaf-1 signaling pathway. 

10 CARD-3 and CARD-4 may bind to a neurotrophin receptor (e.g., 

p75 NTK ). CARD-3 and CARD-4 may modulate the activity of a neurotrophin 
receptor and thus modulate apoptosis of neuronal cells. Accordingly. CARD-3 
and CARD-4 nucleic acids and polypeptides as well as modulators of CARD-3 or 
CARD-4 activity or expression can be used to modulate apoptosis of neurons 

15 (e.g.. for treatment of neurological disorders, particularly neurodegenerative 
disorders). 

Accordingly, another embodiment of the invention features isolated.- 
CARD-3. CARD-4. CARD-5. or CARD-6 proteins and polypeptides having a 
CARD-3. CARD-4. CARD-5. or CARD-6 activity. 
20 Various aspects of the invention arc described in further detail in the 

following subsections. 

I. Isolated Nucleic Acid Molecules 

One aspect of the invention pertains to isolated nucleic acid'molecules 

2 5 that encode CARD-3. CARD-4, CARD-5. or CARD-6 proteins or biologically 

active portions thereof, as well as nucleic acid molecules sufficient for use as 
hybridization probes to identify CARD-3. CARD-4. CARD-5. or CARD-6- 
encoding nucleic acids (e.g.. CARD-3. CARD-4. CARD-5. or CARD-6 mRNA ) 
and fragments for use as PCR primers for the amplification or mutation of 

3 0 CARD-3. CARD-4. CARD-5. or CARD-6 nucleic acid molecules. As used 
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herein, the term "nucleic acid molecule" is intended to include DNA molecules 
(e.g., cDNA or genomic DNA) and RNA molecules (e.g., mRN A) and analogs of 
the DNA or RNA generated using nucleotide analogs. The nucleic acid molecule 
can be single-stranded or double-stranded, but preferably is double-stranded 
5 DNA. 

An "isolated" nucleic acid molecule is one which is separated from 
other nucleic acid molecules which are present in the natural source of the nucleic 
acid. Preferably, an "isolated" nucleic acid is free of sequences (preferably 
protein encoding sequences) which naturally flank the nucleic acid (i.e.. 

10 sequences located at the 5' and 3' ends of the nucleic acid ) in the genomic DNA 
of the organism from which the nucleic acid is derived. For example, in various 
embodiments, the isolated CARD-3. CARD-4, CARD-5. or CARD-6 nucleic acid 
molecule can contain less than about 5 kb. 4 kb, 3 kb. 2 kb. 1 kb. 0.5 kb or 0. 1 kb 
of nucleotide sequences which naturally flank the nucleic acid molecule in 

15 genomic DNA of the cell from which the nucleic acid is derived. Moreover, an 
"isolated" nucleic acid molecule, such as a cDNA molecule, can be substantially 
free of other cellular material, or culture medium when produced by recombinant 
techniques, or substantially free of chemical precursors or other chemicals when 
chemically synthesized. 

2 0 A nucleic acid molecule of the present invention, e.g.. a nucleic acid 

molecule having the nucleotide sequence of SEQ ID NO: 1 . SEQ ID NO:3. SEQ 
ID NO:7, SEQ ID NO:9. SEQ ID:25. SEQ ID NO:27. SEQ ID NO:38. SEQ ID 
NO:40. SEQ ID NO:42. SEQ ID NO:48. SEQ ID NO:50. SEQ ID NO:51. SEQ 
ID NO:53. SEQ ID NO:54. SEQ ID NO:56. SEQ ID NO:60. SEQ ID NO:62. the 

2 5 cDNA of ATCC 203037. the cDNA of ATCC 203035. the cDNA of ATCC 

203036. the cDNA of ATCC PTA-21 1. the cDNA of ATCC PTA-21 2. or the 
cDNA of ATCC PTA-21 3. or a complement of any of these nucleotide 
sequences, can be isolated using standard molecular biology techniques and the 
sequence information provided herein. Using all or portion of the nucleic acid 

3 0 sequences of SEQ ID NO: 1 . SEQ ID NO:3. SEQ ID NO:7. SEQ ID NO:9. SEQ 

- 30 - 



BMSOOCID <WO 0100626A2 I » 



WO 01/00826 



PCT/US00/17691 



ID:25, SEQ ID NO:27, SEQ ID NO:38. SEQ ID NO:40, SEQ ID NO:42, SEQ ID 
NO:48. SEQ ID NO:50, SEQ ID NO:5L SEQ ID NO:53. SEQ ID NO:54. SEQ 
ID NO:56. SEQ ID NO:60, SEQ ID NO:62, the cDNA of ATCC 203037 the 
cDNA of ATCC 203035, the cDNA of ATCC 203036, the cDNA of ATCC PTA- 
5 21 1. the cDNA of ATCC PTA-212, or the cDNA of PTA-213. as a hybridization 
probe, CARD-3. CARD-4, CARD-5, or CARD-6 nucleic acid molecules can be 
isolated using standard hybridization and cloning techniques (e.g.. as described in 
Sambrook et aL eds.. Molecular Cloning: A Laboratory Manual. 2nd. ed.. Cold 
Spring Harbor Laboratory, Cold Spring Harbor Laboratory' Press. Cold Spring 

10 Harbor. NY, 1989). 

A nucleic acid of the invention can be amplified using cDNA. mRNA 
or genomic DNA as a template and appropriate oligonucleotide primers 
according to standard PCR amplification techniques. The nucleic acid so 
amplified can be cloned into an appropriate vector and characterized by DNA 

15 sequence analysis. Furthermore, oligonucleotides corresponding to CARD-3; 
CARD-4. CARD-5, or CARD-6 nucleotide sequences can be prepared by 
- standard synthetic techniques, e.g., using an automated DNA synthesizer. „ ; ~, 
In another embodiment, an isolated nucleic acid molecule of the 
invention comprises a nucleic acid molecule which is a complement of the 

2 0 nucleotide sequence shown in SEQ ID NO: 1 . SEQ ID NO:3. SEQ ID NO:7. SEQ 
ID NO:9. SEQ ID:25. SEQ ID NO:27, SEQ ID NO:38. SEQ ID NO:40. SEQ ID 
NO:42. SEQ ID NO:48. SEQ ID NO:50. SEQ ID NO:5L SEQ ID NO:53. SEQ 
ID NO:54. SEQ ID NO:56, SEQ ID NO:60, SEQ ID NO:62. the cDNA of ATCC 
203037. the cDNA of ATCC 203035. or the cDNA of ATCC 203036. or the 

2 5 cDNA of ATCC PTA-21 1. the cDNA of PTA-212. or the cDNA of PTA-213. or 

a portion thereof. A nucleic acid molecule which is complementary to a given 
nucleotide sequence is one which is sufficiently complementary to the given 
nucleotide sequence that it can hybridize to the given nucleotide sequence 
thereby forming a stable duplex. 

3 0 Moreover, the nucleic acid molecule of the invention can comprise 
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only a portion of a nucleic acid sequence encoding CARD-3, CARD-4, CARD-5, 
or CARD-6, for example, a fragment which can be used as a probe or primer or a 
fragment encoding a biologically active portion of CARD-3, CARD-4, CARD-5, 
or CARD-6. The nucleotide sequence determined from the cloning of the human 
5 CARD-3, CARD-4, CARD-5, or CARD-6, and the partial murine CARD-4 gene 
allows for the generation of probes and primers designed for use in identifying 
and/or cloning CARD-3. CARD-4, CARD-5. or CARD-6 homologues in other 
cell types, e.g.. from other tissues, as well as CARD-3. CARD-4. CARD-5. or 
CARD-6 homologues and orthologs from other mammals. The probe/primer 

10 typically comprises substantially purified oligonucleotide. The oligonucleotide 
typically comprises a region of nucleotide sequence that hybridizes under 
stringent conditions to at least about 12. preferably about 25. more preferably 
about 50. 75, 100, 125. 150. 175, 200. 250. 300. 350 or 400 consecutive 
nucleotides of the sense or anti-sense sequence of SEQ ID NO: 1 , SEQ ID NO:3. 

1 5 SEQ ID NO:7, SEQ ID NO:9, SEQ ID:25. SEQ ID NO:27, SEQ ID NO:38, SEQ 
ID NO:40, SEQ ID NO:42. SEQ ID NO:48. SEQ ID NO:50. SEQ ID NO:5 1 , 
SEQ ID NO:53. SEQ ID NO:54, SEQ ID NO:56. SEQ ID NO:60. SEQ ID 
NO:62. the cDNA of ATCC 203037. the cDNA of ATCC 203035. or the cDNA 
of ATCC 203036, or the cDNA of ATCC PTA-21 1. the cDNA of PTA-212. or 

2 0 the cDNA of PTA-21 3. or of a naturally occurring mutant of one of SEQ ID 

NO:l. SEQ ID NO:3. SEQ ID NO:7. SEQ ID NO:9. SEQ ID:25, SEQ ID NO:27. 
SEQ ID NO:38. SEQ ID NO:40. SEQ ID NO:42. SEQ ID NO:48. SEQ ID 
NO:50. SEQ ID NO:51. SEQ ID NO:53. SEQ ID NO:54. SEQ ID NO:56. SEQ 
ID NO:60. SEQ ID NO:62. the cDNA of ATCC 203037. the cDNA of ATCC 
25 203035. the cDNA of ATCC 203036. or the cDNA of ATCC PTA-21 1. the 
cDNA of PTA-212. or the cDNA of PTA-21 3. 

Probes based on the CARD-3. CARD-4. CARD-5. or CARD-6 
nucleotide sequence can be used to detect transcripts or genomic sequences 
encoding the same or similar proteins. The probe comprises a label group 

3 0 attached thereto, e.g.. a radioisotope, a fluorescent compound, an enzyme, or an 
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enzyme co-factor. Such probes can be used as a part of a diagnostic test kit for 
identifying allelic variants and orthologs of the CARD-3, CARD-4, CARD-5. or 
CARD-6 proteins of the present invention, identifying cells or tissue which mis- 
express a CARD-3, CARD-4, CARD-5, or CARD-6 protein, such as by 
5 measuring a level of a CARD-3, CARD-4, CARD-5, or CARD-6-encoding 

nucleic acid in a sample of cells from a subject, e.g., detecting CARD-3, CARD- 
4. CARD-5. or CARD-6 mRNA levels or determining whether a genomic 
CARD-3. CARD-4. CARD-5, or CARD-6 gene has been mutated or deleted. 

A nucleic acid fragment encoding a "biologically active portion" of 

1 0 CARD-3. CARD-4. CARD-5. or CARD-6 can be prepared by isolating a portion 

of SEQ ID NO:l. SEQ ID NO:3. SEQ ID NO:7. SEQ ID NO:9. SEQ ID:25. SEQ 
ID NO:27, SEQ ID NO:3S. SEQ ID NO:40. SEQ ID NO:42. SEQ ID NO:48. 
SEQ ID NO:50. SEQ ID NO:5L SEQ ID NO:53. SEQ ID NO:54, SEQ ID 
NO:56. SEQ ID NO:60. SEQ ID NO:62. the cDNA of ATCC 203037. the cDNA 
15 of ATCC 203035, the cDNA of ATCC 203036. or the cDNA of ATCC PTA-21 I x 
the cDNA of PTA-21 2. or the cDNA of PTA-21 3, which encodes a polypeptide 
having a CARD-3. CARD-4. CARD-5. or CARD-6 biological activity. ~ : - 
expressing the encoded portion of CARD-3. CARD-4. CARD-5. or CARD-6 
protein (e.g.. by recombinant expression in vitro) and assessing the activity of the 

2 0 encoded portion of CARD-3, CARD-4. CARD-5, or CARD-6. For example, a 

nucleic acid fragment encoding a biologically active portion of CARD-3. CARD- 
4. CARD-5. . or CARD-6 includes a CARD domain, e.g.. SEQ ID NO:6. SEQ ID 
NO: 1 0. SEQ ID NO:28. SEQ ID NO:57. SEQ ID NO:58. SEQ ID NO:59. or 
SEQIDNO:62. 

2 5 The invention further encompasses nucleic acid molecules thai differ 

from the nucleotide sequence of SEQ ID NOT. SEQ ID NO:3. SEQ ID NO:7. 
SEQ ID NO:9. SEQ ID;25. SEQ ID NO:27. SEQ ID NO:38. SEQ ID NO:40. 
SEQ ID NO:42. SEQ ID NO:48. SEQ ID NO:50. SEQ ID NO:5L SEQ ID 
NO:53. SEQ ID NO:54, SEQ ID NO:56. SEQ ID NO:60. SEQ ID NO:62. the 

3 0 cDNA of ATCC 203037. the cDNA of ATCC 203035. the cDNA of ATCC 
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203036, or the cDNA of ATCC PTA-2 1 1 , the cDNA of PTA-2 12 ? or the cDNA 
of PTA-2 13, due to degeneracy of the genetic code and thus encode the same 
CARD-3, CARD-4, CARD-5, or CARD-6 protein as that encoded by the 
nucleotide sequence shown in SEQ ID NOT, SEQ ID NO:3. SEQ ID NO:7, SEQ 

5 ID NO:9. SEQ 1D:25, SEQ ID NO:27, SEQ ID NO:38, SEQ ID NO:40. SEQ ID 
NO:42, SEQ ID NO:48, SEQ ID NO:50, SEQ ID NO:51. SEQ ID NO:53. SEQ 
ID NO:54. SEQ ID NO:56. SEQ ID NO:60, SEQ ID NO:62, the cDNA of ATCC 

203037, the cDNA of ATCC 203035, the cDNA of ATCC 203036. or the cDNA 
of ATCC PTA-2 1 1. the cDNA of ATCC PTA-2 12. or the cDNA of ATCC PTA- 

10 213. 

In addition to the CARD-3, CARD-4, CARD-5. or CARD-6 
nucleotide sequence shown in SEQ ID NOT. SEQ ID NO:3. SEQ ID NO:7. SEQ 
ID NO:9, SEQ ID:25. SEQ ID NO:27. SEQ ID NO:38, SEQ ID NO:40. SEQ ID 
NO:42. SEQ ID NO:48, SEQ ID NO:50, SEQ ID NO:51. SEQ ID NO:53, SEQ 

15 ID NO:54, SEQ ID NO:56, SEQ ID NO:60, SEQ ID NO:62, the cDNA of ATCC 
203037. the cDNA of ATCC 203035, the cDNA of ATCC 203036. the cDNA of 
ATCC PTA-2 1 L the cDNA of ATCC PTA-2 12, or the cDNA of ATCC PTA- 
213, it will be appreciated by those skilled in the art thai DNA sequence 
polymorphisms that lead to changes in the amino acid sequences of CARD-3, 

2 0 CARD-4. CARD-5, or CARD-6 may exist within a population (e.g.. the human 
population). Such genetic polymorphism in the CARD-3, CARD-4. CARD-5. or 
CARD-6 gene may exist among individuals within a population due to natural 
allelic variation. As used herein, the terms "gene" and "recombinant gene" refer 
ro nucleic acid molecules comprising an open reading frame encoding a CARD- 

2 5 3. CARD-4. CARD-5. or CARD-6 protein, preferably a mammalian CARD-3. 

CARD-4, CARD-5. or CARD-6 protein. Such natural allelic variations can 
typically result in 1-5% variance in the nucleotide sequence of the CARD-3. 
CARD-4. CARD-5. or CARD-6 gene. Any and all such nucleotide variations 
and resulting amino acid polymorphisms in CARD-3. CARD-4. CARD-5. or 

3 C CARD-6 that are the result of natural allelic variation and that do not alter the 
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functional activity of CARD-3, CARD-4, CARD-5, or CARD-6 are intended to 
be within the scope of the invention. Thus, e.g., 1%, 2%, 3%, 4%, or 5% of the 
amino acids in CARD-3. CARD-4, CARD-5, or CARD-6 are replaced by another 
amino acid, preferably the amino acids are replaced by conservative substitutions. 
5 Moreover, nucleic acid molecules encoding CARD-3, CARD-4, 

CARD-5. or CARD-6 proteins from other species (CARD-3. CARD-4. CARD-5. 
or CARD-6 orthologs/homologues), which have a nucleotide sequence which 
differs from that of a CARD-3, CARD-4. CARD-5, or CARD-6 disclosed herein, 
are intended to be within the scope of the invention. 

1 o For example. Example 5 describes the murine CARD-4 ortholog and 

Example 14 describes the murine CARD-5 ortholog. Nucleic acid molecules 
corresponding to natural allelic variants and homologues of the CARD-3, CARD- 
4. CARD-5. or CARD-6 cDNA of the invention can be isolated based on their 
similarity to the nucleic acids disclosed herein using the human or murine 
1 5 cDN As. or a portion thereof, as a hybridization probe according to standard - 
hybridization techniques under stringent hybridization conditions. 

In general, an allelic variant of a gene will be readily identifiable as- 
mapping to the same chromosomal location as said gene. For example, in 
Example 6, the chromosomal location of the human CARD-4 gene is discovered 

2 0 to be chromosome 7 close to the SHGC-3 1 928 genetic marker. Allelic variants 

of human CARD-4 will be readily identifiable as mapping to the human CARD-4 
locus on chromosome 7 near genetic marker SHGC-3 1928. 

Accordingly, in another embodiment, an isolated nucleic acid 
molecule of the invention is at least 300 (325. 350. 375. 400. 425. 450. 500. 550. 

2 5 600. 650. 700. 800. 900. 1 000. 1 300. 1 600 or 1 93 1 ) nucleotides in length and 

hybridizes under stringent conditions to the nucleic acid molecule comprising the 
nucleotide sequence, preferably the coding sequence, of SEQ ID NO: 1 . SEQ ID 
NO:3. or the cDNA of ATCC 203037. In yet another embodiment, an isolated 
nucleic acid molecule of the invention, is at least 300 (325. 350. 375. 400. 425. 

3 0 450. 500. 550. 600. 650. 700. 800. 900. 1 000. or 1 300. 1 640. 1 900. 2200. 2500. 
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2800, 3100, or 3382) nucleotides in length and hybridizes under stringent 
conditions to the nucleic acid molecule comprising the nucleotide sequence, 
preferably the coding sequence, of SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:25, 
SEQ ID NO:27, SEQ ID NO:38, SEQ ID NO:40, the cDNA of ATCC 203035, or 
5 the cDNA of ATCC 203036. Accordingly, in another embodiment, an isolated 
nucleic acid molecule of die invention is at least 300 (325, 350, 375, 400, 425, 
450, 500, 550, 600, 650. 700, 800, 900, 1000, or 1300, 1640. 1900, 2200, 2500, 
2800, 3100, 3300, 3600. 3900. 4200 or 4209) nucleotides in length and 
hybridizes under stringent conditions to the nucleic acid molecule comprising the 

1 0 nucleotide sequence, preferably the coding sequence, of SEQ ID NO:42. 

In yet another embodiment, an isolated nucleic acid molecule of the 
invention is at least 300 (350. 400. 450. 500. 550. 600. 650, 700. or 740) 
nucleotides in length and hybridizes under stringent conditions to a nucleic acid 
molecule consisting of the nucleotide sequence of SEQ ID NO:48 or SEQ ID 

15 NO:50. 

In yet another embodiment, an isolated nucleic acid molecule of the 
invention is at least 300 (350. 400. 450. 500. 550. 600. 650. 700. or 761 ) 
nucleotides in length and hybridizes under stringent conditions to a nucleic acid 
molecule consisting of the nucleotide sequence of SEQ ID NO:60 or SEQ ID 
2 0 NO:62. 

In yet another embodiment, an isolated nucleic acid molecule of the 
invention is at least 300 (350. 400, 450. 500, 550. 600. 650. 700. 1 000. 1 500. 
2000. 2500. 3000. 3500. 4000. 4500, 5000. 5200. or 5252) nucleotides in length 
and hybridizes under stringent conditions to a nucleic acid molecule consisting of 
2 5 the nucleotide sequence of SEQ JD NO:51 or SEQ ID NO:53. 

In yet another embodiment, an isolated nucleic acid molecule of the 
invention is at least 300 (350. 400. 450. 500. 550. 600. 650. 700. 1 000. 1 500. 
2000. 2500. 3000. 3500. 4000. 4500. or 5000) nucleotides in length and 
hybridizes under stringent conditions to a nucleic acid molecule consisting of the 
3C nucleotide sequence of SEQ ID NO:54 or SEQ ID NO:56. 
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As used herein, the term "hybridizes under stringent conditions" is 
intended to describe conditions for hybridization and washing under which 
nucleotide sequences at least .60% (65%, 70%, preferably 75%) identical to each 
other typically remain hybridized to each other. Such stringent conditions are 
5 known to those skilled in the art and can be found in Current Protocols in 
N4olecular Biology, John Wiley & Sons. N.Y. (1989). 6.3.1-6.3.6. An. 
non-limiting example of stringent hybridization conditions are hybridization in 
6X sodium chloride/sodium citrate (SSC) at about 45 DC. followed by one or 
more washes in 0.2 X SSC, 0.1% SDS at 50-65 DC (e.g., 50OC or 60 UC or 

10 65 r C). Preferably, the isolated nucleic acid molecule of the invention that 

hybridizes under stringent conditions corresponds to a naturally-occurring nucleic 
acid molecule. As used herein, a "naturally-occurring" nucleic acid molecule 
refers to an RNA or DNA molecule having a nucleotide sequence that occurs in 
nature (e.g.. encodes a natural protein). 

15 In addition to naturally-occurring allelic variants of the CARD-3. 

CARD-4. CARD-5, or CARD-6 sequence that may exist in the population, the 
skilled artisan will further appreciate that changes can be introduced by mutation.*, 
into the nucleotide sequence of SEQ ID NO:l. SEQ ID NO:3. SEQ ID NO:7, 
SEQ ID NO:9. SEQ ID:25, SEQ ID NO:27. SEQ ID NO:38. SEQ ID NO:40. 

2 0 SEQ ID NO:42. SEQ ID NO:48, SEQ ID NO:50. SEQ ID NO:5 1 . SEQ ID 
NO:53. SEQ ID NO:54. SEQ ID NO:56. SEQ ID NO:60. SEQ ID NO:62. the 
cDNA of ATCC 203037. the cDNA of ATCC 203035. the cDNA of ATCC 
203036. the cDNA of ATCC PTA-21 1. the cDNA of ATCC PTA-2.12. or the 
cDNA of ATCC PTA-21 3. thereby leading to changes in the amino acid 

2 5 sequence of the encoded protein without altering the functional ability of the 

protein. For example, one can make nucleotide substitutions leading to amino 
acid substitutions at "non-essential" amino acid residues. A "non-essential" 
ammo acid residue is a residue that can be altered from the wild-type sequence of 
CARD-3, CARD-4 L/S. CARD-4 splice variant, murine CARD-4 protein, human 

3 0 CARD-5 protein, murine CARD-5 protein, or rat CARD-6 protein without 
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altering the biological activity, whereas an "essential" amino acid residue is 
required for biological activity. For example, amino acid residues that are 
conserved among the CARD-3, CARD-4L/S, CARD-4 splice variant, CARD-4, 
CARD-5, or CARD-6 proteins of various species are predicted to be particularly 
5 unamenable to alteration. 

For example, preferred CARD-3. CARD-4. CARD-5. and CARD-6 
proteins of the present invention, contain at least one CARD domain. 
Additionally, a CARD-3 protein also contains at least one kinase domain or at 
least one linker domain. A CARD domain contains at least one nucleotide 

1 0 binding domain or Leucine-rich repeats. Such conserved domains are less likely 
to be amenable to mutation. Other amino acid residues, however, (e.g.. those that 
are not conserved or only semi-conserved among CARD-3. CARD-4. CARD-5, 
or CARD-6 of various species) may not be essential for activity and thus are 
likely to be amenable to alteration. 

1 5 Accordingly, another aspect of the invention pertains to nucleic acid 

molecules encoding CARD-3, CARD-4, CARD-5, or CARD-6 proteins that 
contain changes in amino acid residues that are not essential for activity. Such 
CARD-3. CARD-4. CARD-5. or CARD-6 proteins differ in amino acid sequence 
from SEQ ID NO:2. SEQ ID NO:8. SEQ ID NO:25. SEQ ID NO:39. SEQ ID 

2 0 NO:41. SEQ ID NO:43. SEQ ID NO:49. SEQ ID NO:52. SEQ ID NO:55. or 
SEQ ID NO:61, and yet retain biological activity. In one embodiment, the 
isolated nucleic acid molecule includes a nucleotide sequence encoding a protein 
that includes an amino acid sequence that is at least about 45% identical. 65%. 
75%. 85%. 95%. or 98% identical to the amino acid sequence of SEQ ID NO:2. 

25 SEQ ID NO:8. SEQ ID NO:26. SEQ ID NO:39. SEQ ID NO:41 . SEQ ID NO:43. 
SEQ ID NO:49. SEQ ID NO:52. SEQ ID NO:55. or SEQ ID NO:61 . 

An isolated nucleic acid molecule encoding a CARD-3. CARD-4. 
CARD-5. or CARD-6 protein having a sequence which differs from that of SEQ) 
ID NO: 1 . SEQ ID NO:3. SEQ ID NO:7. SEQ ID NO:9. SEQ ID:25. SEQ ID 

3C NO:27. SEQ ID NO:38. SEQ ID NO:40, SEQ ID NO:42. SEQ ID NO:48. SEQ 
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ID NO:50, SEQ ID NO:51, SEQ ID NO:53, SEQ ID NO:54, SEQ ID NO:56, 
SEQ ID NO:60, SEQ ID NO:62, the cDNA-of ATCC 203037, the cDNA of 
ATCC 203035. the cDNA of ATCC 203036, the cDNA of ATCC PTA-21 1, the 
cDNA of ATCC PTA-21 2, or the cDNA of ATCC PTA-21 3, can be created by 
5 introducing one or more nucleotide substitutions, additions or deletions into the 
nucleotide sequence of CARD-3 (SEQ ID NO: 1 , SEQ ID NO:3. the cDN A of 
ATCC 203037) or CARD-4L (SEQ ID NO:7, SEQ ID NO:9, the cDNA of ATCC 
203035 ). or CARD-4S (SEQ ID NO:25, SEQ ID NO:27. the cDNA of ATCC 
203036), or human CARD-4 splice variants (SEQ ID NO:38, SEQ ID NO:40. or 

10 murine CARD-4 (SEQ ID NO:42). or murine CARD-5 (SEQ ID NO:60. SEQ ID 
NO:62. the cDNA of PTA-21 1). or human CARD-5 (SEQ ID NO:48. SEQ ID 
NO:50. the cDNA of ATCC PTA-21 3). rat CARD-6 (SEQ ID NO:51. SEQ ID 
NO:53. the cDNA of ATCC PTA-21 1 ). or human CARD-6 (SEQ ID NO:54. 
SEQ) ID NO:56. the cDNA of ATCC PTA-21 3) such that one or more amino acid 

1 5 substitutions, additions or deletions are introduced into the encoded protein- 
Mutations can be introduced by standard techniques, such as site-directed 
mutagenesis and PCR-mediated mutagenesis. Preferably, conservative amino 
acid substitutions are made at one or more predicted non-essential amino acid 
residues. Thus, for example. 1%. 2%. 3%. 5%. or 10% of the amino acids can be 

2 0 replaced by conservative substitution. A "conservative amino acid substitution" 
is one in which the amino acid residue is replaced with an amino acid residue 
having a similar side chain. Families of amino acid residues having similar side 
chains have been defined in the art. These families include amino acids with 
basic side chains (e.g.. lysine, arginine. histidine). acidic side chains (e.g.. 

2 5 aspartic acid, glutamic acid), uncharged polar side chains (e.g., glycine. 

asparagine. glutamine. serine, threonine, tyrosine, cysteine), nonpolar side chains 
(e.g.. alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, 
tryptophan), beta-branched side chains (e.g.. threonine, valine, isoleucine) and 
aromatic side chains (e.g.. tyrosine, phenylalanine, tryptophan, histidine). Thus. 

3 0 a predicted nonessential amino acid residue in CARD-3. CARD-4. CARD-5. or 
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CARD-6 is preferably replaced with another amino acid residue from the same 
side chain family. Alternatively, mutations can be introduced randomly along all 
or part of a CARD-3, CARD-4, CARD-5, or CARD-6 coding sequence, such as 
by saturation mutagenesis, and the resultant mutants can be screened for CARD- 
5 3. CARD-4, CARD-5, or CARD-6 biological activity to identify mutants that 
retain activity. Following mutagenesis, the encoded protein can be expressed 
recombinantly and the activity of the protein can be determined. 

In an embodiment, a mutant CARD-3. CARD-4. CARD-5. or CARD- 
6 protein can be assayed for: (1 ) the ability to form protein:protein interactions 

1 0 with proteins in the apoptotic signaling pathway; (2) the ability to bind a CARD- 
3. CARD-4. CARD-5. or CARD-6 ligand; or (3) the ability to bind to an 
intracellular target protein. For example, (1) in Example 7. a two-hybrid 
screening assay for the physical interaction of CARD-3 and CARD-4 is shown. 
(2) in Example 8. a two-hybrid system assay for the interaction between CARD-4 

15 and its ligand hNUDC is described, and (3) in Example 12, a 

coimmunoprecipitation assay for the interaction of CARD-3 with its ligand 
CARD-4 is shown. In yet another embodiment, a mutant CARD-3. CARD-4, 
CARD-5. or CARD-6 protein can be assayed for the ability to modulate cellular 
proliferation, cellular differentiation, or cellular death. For example, in Example 

2 0 10. assays for the regulation of cellular death (apoptosis) by CARD-3 or CARD-4 
are described. In yet another embodiment, a mutant CARD-3 or CARD-4 protein 
can be assayed for regulation of a cellular signal transduction pathway. For 
example, in Example 9. an assay for the regulation by CARD-4 of the NF-kB 
pathway is described. 

2 5 The present invention encompasses antisense nucleic acid molecules. 

i.e.. molecules which are complementary to a sense nucleic acid encoding a 
protein, e.g.. complementary to the coding strand of a double-stranded cDNA 
molecule or complementary to an mRNA sequence. Accordingly, an antisense 
nucleic acid can hydrogen bond to a sense nucleic acid. The antisense nucleic 

3 0 acid can be complementary to an entire CARD-3. CARD-4. CARD-5. or CARD- 
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6 coding strand, or to only a portion thereof, e.g.. all or part of the protein coding 
region (or open reading frame). An antisense nucleic acid molecule can be 
antisense to a noncoding region of the coding strand of a nucleotide sequence 
encoding CARD-3, CARD-4, CARD-5, or CARD-6. The noncoding regions ("5' 
5 and 3' untranslated regions") are the 5' and 3 1 sequences which flank the coding 
region and are not translated into amino acids. Given the coding strand 
sequences encoding CARD-3, CARD-4, CARD-5. and CARD-6 disclosed 
herein, antisense nucleic acids of the invention can be designed according to the 
rules of Watson and Crick base pairing. The antisense nucleic acid molecule can 

1 0 be complementary to the entire coding region of CARD-3, CARD-4. CARD-5. or 

CARD-6L/S mRNA. but more preferably is an oligonucleotide which is antisense 
to only a portion of the coding or noncoding region of CARD-3. CARD-4. 
CARD-5, or CARD-6 mRNA. For example, the antisense oligonucleotide can be 
complementary to the region surrounding the translation start site of CARD-3 
15 mRNA. e.g.. an oligonucleotide having the sequence 

CCCTGGTACTTGCCCCTCCGGTAG (SEQ ID NO:34) or 
CCTGGTACTTGCCCCTCC (SEQ ID NO:35) or of the CARD-4 L mRNA. e.g.. 
TCGTTAAGCCCTTGAAGACAGTG (SEQ ID NO:36) and 
TCGTTAGCCCTTGAAGACCAGTGAGTGTAG (SEQ ID NO:37) or of the 

2 0 human CARD-5 mRNA. e.g., TAGGACCTCGGTACCCGCGCGCGCG (SEQ 

ID NO:68) or CGCCGGCCCC TAGGACCTCGGTACC (SEQ ID NO:69). An 
antisense oligonucleotide can be. for example, about 5. 1 0. 1 5. 20. 25. 30. 35. 40. 
45 or 50 nucleotides in length. An antisense nucleic acid of the invention can be 
constructed using chemical synthesis and enzymatic ligation reactions using 

2 5 procedures known in the art. For example, an antisense nucleic acid (e.g.. an 

antisense oligonucleotide) can be chemically synthesized using naturally 
occurring nucleotides or variously modified nucleotides designed to increase the 
biological stability of the molecules or to increase the physical stability of the 
duplex formed between the antisense and sense nucleic acids, e.g.. 

3 0 phosphorothioate derivatives and acridine substituted nucleotides can be used. 

-47 - 

BNSDOCID <WO 0i0O6?6A2 I - 



WO 01/00826 



PCI7US00/17691 



Examples of modified nucleotides which can be used to generate the antisense 
nucleic acid include 5-fluorouracih 5-bromouracil, 5-chlorouracil, 5-iodouracil. 
hypoxanthine, xanthine, 4-acetylcytosine, 5-(carboxyhydroxylmethyl) uracil. 5- 
carboxymethylaminomethyl-2-thiouridine. 5-carboxymethylaminomethyluraciL 
5 dihydrouracil. beta-D-galactosylqueosine, inosine, N6-isopentenyladenine. 1- 
methylguaninc, 1-methylinosine. 2.2-dimethylguanine. 2-methyladenine, 2- 
methylguanine. 3-methylcytosine, 5-methylcytosine, N6-adenine. 7- 
methylguanine. 5-methylaminomethyluracil, 5-methoxyaminomethyl-2- 
thiouracil, beta-D-mannosylqueosine, 5-methoxycarhoxy methyl uracil, 5- 
1 0 methoxvuracil. 2-methylthio-N6-isopentenyladenine. uracilo-oxyacetic acid (v). 
wybutoxosine. pseudouraciL queosine, 2-thiocytosine, 5-methyl-2-thiouracil. 2- 
thiouracil 4-thiouracil. 5-methyluracil. uracilo-oxyacetic acid methylester. 
uracilo-oxyacetic acid (v). 5-methyl-2-thiouracil. 3-(3aino-3-N-2-carboxypropyl) 
uracil. (acp3)w, and 2,6-diaminopurine. Alternatively, the antisense nucleic acid 
1 5 can be produced biologically using an expression vector into which a nucleic acid 
has been subcloned in an antisense orientation (i.e.. RNA transcribed from the 
inserted nucleic acid will be of an antisense orientation to a target nucleic acid of 
interest, described further in the following subsection). 

The antisense nucleic acid molecules of the invention are typically 
2 0 administered to a subject or generated in situ such that they hybridize with or 
bind to cellular mRNA and/or genomic DNA encoding a CARD-3, CARD-4. 
CARD-5. or CARD-6 protein to thereby inhibit expression of the protein, e.g.. by 
inhibiting transcription and/or translation. The hybridization can be by 
conventional nucleotide complementarity to form a stable duplex, or. for 

2 5 example, in the case of an antisense nucleic acid molecule which binds to DNA 

duplexes, through specific interactions in the major groove of the double helix. 
An example of a route of administration of antisense nucleic acid molecules of 
the invention include direct injection at a tissue site. Alternatively, antisense 
nucleic acid molecules can be modified to targei selected cells and then 

3 0 administered systemically. For example, for systemic administration, antisense 
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molecules can be modified such that they specifically bind to receptors or 
antigens. expressed on a selected cell surface, e.g., by linking the antisense 
nucleic acid molecules to peptides or antibodies which bind to cell surface 
receptors or antigens. The antisense nucleic acid molecules can also be delivered 
5 to cells using the vectors described herein. To achieve sufficient intracellular 
concentrations of the antisense molecules, vector constructs in which the 
antisense nucleic acid molecule is placed under the control of a strong pol II or 
pol III promoter are preferred. 

An antisense nucleic acid molecule of the invention can be an a- 

10 anomeric nucleic acid molecule. An a-anomeric nucleic acid molecule forms 
specific double-stranded hybrids with complementary RNA in which, contrary to 
the usual B-units, the strands run parallel to each other (Gaultier et al. (1987) 
Nucleic Acids. Res. 15:6625-6641). The antisense nucleic acid molecule can 
also comprise a 2-o-methylribonucleotide (Inoue et al. (1987) Nucleic Acids Res. 

15 15:6131-6148) or a chimeric RNA-DNA analogue (Inoue et al. (1987) FEBS % 
Lett. 215:327-330). 

The invention also encompasses ribozymes. Ribozymes are catalytic 
RNA molecules with ribonuclease activity which are capable of cleaving a 
single-stranded nucleic acid, such as an mRNA. to which they have a 

2 0 complementary region. Thus, ribozymes (e.g.. hammerhead ribozymes 

(described in Haselhoff and Gerlach (1988) Nature 334:585-91 )) can be used to 
catalytically cleave CARD-3. CARD-4, CARD-5. or CARD-6 mRNA transcripts 
to thereby inhibit translation of CARD-3. CARD-4, CARD-5. or CARD-6 
mRNA. A ribozyme having specificity for a CARD-3. CARD-4. CARD-5. or 

2 5 CARD-6-encoding nucleic acid can be designed based upon the nucleotide 

sequence of a CARD-3. CARD-4. CARD-5. or CARD-6 cDNA disclosed herein. 
For example, a derivative of a Tetrahymena L-19 IVS RNA can be constructed 
in which the nucleotide sequence of the active site is complementary to the 
nucleotide sequence to be cleaved in a CARD-3. CARD-4. CARD-5. or CARD- 

.30 6-encoding mRNA. Sec. e.g.. Cech et al. U.S. Patent No. 4.987.071 : and Cech el 
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al. U.S. Patent No. 5,1 16,742. Alternatively, CARD-3. CARD-4, CARD-5. or 
CARD-6 mRNA can be used to select a catalytic RNA having a specific 
ribonuclease activity from a pool of RNA molecules. See, e.g., Bartel and 
Szostak ( 1 993 ) Science 261:1411-1418. 
5 The invention also encompasses nucleic acid molecules which form 

triple helical structures. For example, CARD-3, CARD-4, CARD-5, or CARD-6 
gene expression can be inhibited by targeting nucleotide sequences 
complementary to the regulatory region of the CARD-3. CARD-4. CARD-5, or 
CARD-6 (e.g.. the CARD-3, CARD-4, CARD-5, or CARD-6 promoter and/or 

1 0 enhancers) to form triple helical structures that prevent transcription of the 
CARD-3. CARD-4, CARD-5, or CARD-6 gene in target cells. See generally, 
Helene (1991) Anticancer Drug Des. 6(6):569-84; Helene (1992) Ann. N.Y. 
Acad. Sci. 660:27-36: and Maher(1992) Bioassays 14(12):807-1 5. 

In embodiments, the nucleic acid molecules of the invention can be 

15 modified at the base moiety, sugar moiety or phosphate backbone to improve, 
e.g., the stability, hybridization, or solubility of the molecule. For example, the 
deoxyribose phosphate backbone of the nucleic acids can be modified to generate 
peptide nucleic acids (see Hyrup et al. (1996) Bioorganic & Medicinal Chemistry 
4( 1 ):5-23). As used herein, the terms "peptide nucleic acids" or "PNAs" refer to 

2 0 nucleic acid mimics, e.g.. DNA mimics, in which the deoxyribose phosphaic 
backbone is replaced by a pseudopeptide backbone and only the four natural 
nucleobases are retained. The neutral backbone of PNAs has been shown to 
allow for specific hybridization to DNA and RNA under conditions of low ionic 
strength. The synthesis of PNA oligomers can be performed using standard solid 

2 5 phase peptide synthesis protocols as described in Hyrup et al. (1996) supra: 

Perry-O'Keefe et al. (1996) Proc. Natl. Acad. Sci. USA 93:14670-675. 

PNAs of CARD-3. CARD-4. CARD-5. or CARD-6 can be used for 
therapeutic and diagnostic applications. For example. PNAs can be used as 
antisense or amigene agents for sequence-specific modulation of gene expression 

3 0 bv. e.g.. inducing transcription or translation arrest or inhibiting replication. 
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PNAs of CARD-3, CARD-4, CARD-5, or CARD-6 can also be used, e.g., in the 
analysis of single base pair mutations in a gene by, e.g., PNA directed PCR 
clamping; as artificial restriction enzymes when used in combination with other 
enzymes, e.g., SI nucleases (Hyrup (1996) supra; or as probes or primers for 
5 DNA sequence and hybridization (Hyrup (1996) supra; Perry-O'Keefe et al. 
(1996) Proc. Natl. Acad. Sci. USA 93: 14670-675). 

In another embodiment, PNAs of CARD-3, CARD-4, CARD-5, or 
CARD-6 can be modified, e.g., to enhance their stability or cellular uptake, by 
attaching lipophilic or other helper groups to PNA. by the formation of PNA- 

1 0 DNA chimeras, or by the use of liposomes or other techniques of drug delivery 
known in the art. For example, PNA-DNA chimeras of CARD-3, CARD-4, 
CARD-5, or CARD-6 can be generated which may combine the advantageous 
properties of PNA and DNA. Such chimeras allow DNA recognition enzymes, 
e.g., RNAse H and DNA polymerases, to interact with the DNA portion while the 

15 PNA portion would provide high binding affinity and specificity. PNA-DNA tt 
chimeras can be linked using linkers of appropriate lengths selected in terms of 
base stacking, number of bonds between the nucleobases. and orientation (Hyrup 
(1996) supra). The synthesis of PNA-DNA chimeras can be performed as 
described in Hyrup (1996) supra and Finn et al. (1996) Nucleic Acids Research 

2 0 24( 17):3357-63. For example, a DNA chain can be synthesized on a solid 
support using standard phosphoramidite coupling chemistry and modified 
nucleoside analogs, e.g., 5'-(4-methoxytrityl )amino-5'-deoxy-thymidine 
phosphoramidite. can be used as a between the PNA and the 5' end of DNA (Mag 
et al. (1989) Nucleic Acid Res. 17:5973-88). PNA monomers are then coupled in 

2 5 a stepwise manner to produce a chimeric molecule with a 5' PNA segment and a 
3' DNA segment (Finn et al. ( 1996) Nucleic Acids Research 24(17):3357-63). 
Alternatively, chimeric molecules can be synthesized with a 5' DNA segment and 
a 3' PNA segment (Peterser et al. (1975) Bioorganic Med. Chem. Lett. 5:1119- 
11124). 

2 0 - • 
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In other embodiments, the oligonucleotide may include other 
appended groups such as peptides (e.g., for targeting host cell receptors in vivo), 
or agents facilitating transport across the cell membrane (see, e.g., Letsinger et al. 
(1989) Proc. Natl. Acad. Sci. USA 86:6553-6556; Lemaitre et al. (1987) Proc. 
5 Natl. Acad. Sci. USA 84:648-652; PCT Publication No. WO 88/09810) or the 
blood-brain barrier (see, e.g., PCT Publication No. W0 89/10134). In addition, 
oligonucleotides can be modified with hybridization-triggered cleavage agents 
(see. e.g.. Krol et al. (1 988) Bio/Techniques 6:958-976) or intercalating agents 
{see. e.g.. Zon (1988) Pharm. Res. 5:539-549). To this end, the oligonucleotide 
10 may be conjugated to another molecule, e.g., a peptide, hybridization triggered 
cross-linking agent, transport agent, hybridization-triggered cleavage agent, etc. 

II. Isolated CARD-3, CARD-4. CARD-5. and CARD-6 Proteins and Anti- 
CARD-3. CARD-4, CARD-5, and CARD-6 Antibodies. 

15 One aspect of the invention pertains to isolated CARD-3. CARD-4. 

CARD-5, and CARD-6 proteins, and biologically active portions thereof, as well 
as polypeptide fragments suitable for use as immunogens to raise anti-CARD-3. 
CARD-4. CARD-5, or CARD-6 antibodies. In one embodiment, native CARD- 
3, CARD-4. CARD-5, or CARD-6 proteins can be isolated from cells or tissue 

2 0 sources by an appropriate purification scheme using standard protein purification 
techniques. In another embodiment, CARD-3. CARD-4. CARD-5. or CARD-6 
proteins are produced by recombinant DNA techniques. Alternative to 
recombinant expression, a CARD-3. CARD-4. CARD-5. or CARD-6 protein or 
polypeptide can be synthesized chemically using standard peptide synthesis 

2 5 techniques. 

An "isolated" or "purified" protein or biologically active portion 
thereof is substantially free of cellular material or other contaminating proteins 
from the cell or tissue source from which the CARD-3. CARD-4. CARD-5. or 
CARD-6 protein is derived, or substantially free from chemical precursors or 

3 C other chemicals when chemically synthesized. The language "substantially free 
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of cellular material" includes preparations of CARD-3, CARD-4, CARD-5, or 
CARD-6 protein in which the protein is separated from cellular components of 
the cells from which it is isolated or recombinantly produced. Thus. CARD-3. 
CARD-4, CARD-5, or CARD-6 protein that is substantially free of cellular 
5 material includes preparations of CARD-3, CARD-4. CARD-5, or CARD-6 
protein having less than about 30%, 20%. 10%. or 5% (by dry weight) of non- 
CARD-3, CARD-4, CARD-5. or CARD-6 protein (also referred to herein as a 
"contaminating protein"). When the CARD-3, CARD-4. CARD-5. or CARD-6 
protein or biologically active portion thereof is recombinantly produced- it is also 

10 preferably substantially free of culture medium, i.e.. culture medium represents 
less than about 20%. 1 0%, or 5% of the volume of the protein preparation. When 
CARD-3, CARD-4, CARD-5. or CARD-6 protein is produced by chemical 
synthesis, it is preferably substantially free of chemical precursors or other 
chemicals, i.e., it is separated from chemical precursors or other chemicals which 

15 are involved in the synthesis of the protein. Accordingly such preparations of . 
CARD-3. CARD-4. CARD-5, or CARD-6 protein have less than about 30%. 
20%. 10%. 5% (by dry weight) of chemical precursors or non-CARD-3. CARD- 
4. CARD-5. or CARD-6 chemicals. 

Biologically active portions of a CARD-3. CARD-4. CARD-5. or 

2 0 CARD-6 protein include peptides comprising amino acid sequences sufficiently 
identical to or derived from the amino acid sequence of the CARD-3. CARD-4, 
CARD-5. or CARD-6 protein (e.g.. the amino acid sequence shown in SEQ ID 
NO:2. SEQ ID NO:8. SEQ ID NO:26, SEQ ID NO:39. SEQ ID NO:41 . SEQ ID 
NO:43. SEQ ID NO:49. SEQ ID NO:52. SEQ ID NO:55. or SEQ ID NO:61 ). 

2 5 which include less amino acids than the full length CARD-3. CARD-4. CARD-5. 

or CARD-6 protein, and exhibit at least one activity of a CARD-3. CARD-4. 
CARD-5. or CARD-6 protein. Typically, biologically active portions comprise a 
domain or motif with at least one activity of the CARD-3. CARD-4. CARD-5. or 
CARD-6 protein. A biologically active portion of a CARD-3. CARD-4. CARD- 

3 0 5. or CARD-6 protein can be a polypeptide which is. for example. 10. 25. 50. 100 
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or more amino acids in length. Preferred biologically active polypeptides include 
one or more identified CARD-3, CARD-4, CARD-5, or CARD-6 structural 
domains, e.g., the CARD domain (SEQ ID NO:6, SEQ ID NO: 1 0, SEQ ID 
NO:27. SEQ ID NO:66, SEQ ID NO:67, or SEQ ID NO:68). 
5 Moreover, other biologically active portions, in which other regions of 

the protein are deleted, can be prepared by recombinant techniques and evaluated 
for one or more of the functional activities of a native CARD-3, CARD-4, 
CARD-5, or CARD-6 protein. 

CARD-3. CARD-4, CARD-5. or CARD-6 protein has the amino acid 
10 sequence shown of SEQ ID NO:2. SEQ ID NO:8. SEQ ID NO:26. SEQ ID 
NO:39. SEQ ID NO:41 . SEQ ID NO:43, SEQ ID NO:49, SEQ ID NO:52. SEQ 
ID NO:55, or SEQ ID NO:6I. Other useful CARD-3, CARD-4. CARD-5, or 
CARD-6 proteins are substantially identical to SEQ ID NO:2, SEQ ID NO:8. 
SEQ ID NO:26. SEQ ID NO:39. SEQ ID NO:41, SEQ ID NO:43, SEQ ID 
15 NO:49, SEQ ID NO:52. or SEQ ID NO:55, or SEQ ID NO:6K and retain the 
functional activity of the protein of SEQ ID NO:2. SEQ ID NO:8. SEQ ID 
NO:26. SEQ ID NO:39. SEQ ID NO:41 , SEQ ID NO:43. SEQ ID NO:49. SEQ 
ID NO:52. SEQ ID NO:55. or SEQ ID NO:6h yet differ in amino acid sequence 
due to natural allelic variation or mutagenesis. CARD-3 and CARD-4 are 
2 0 involved in activating caspases in the apoptotic pathway. For example, in 
Example 1 0. CARD-4 is shown to enhance caspase 9 activity. 

A useful CARD-3. CARD-4. CARD-5. or CARD-6 protein is a 
protein which includes an amino acid sequence at least about 45%. preferably 
55%. 65%. 75%. 85%. 95%. or 99% identical to the amino acid sequence of SEQ 
2 5 ID NO:2. SEQ ID NO:8. SEQ ID NO:26. SEQ ID NO:39, SEQ ID NO:41 . SEQ 
ID NO:43. SEQ ID NO:49, SEQ ID NO:52. SEQ ID NO:55. or SEQ ID NO:61 . 
and retains the functional activity of the CARD-3. CARD-4. CARD-5. or CARD- 
6 proteins of SEQ ID NO:2. SEQ ID NO:8. SEQ ID NO:26. SEQ ID NO:39. 
SEQ ID NO:41. SEQ ID NO:43. SEQ ID NO:49. SEQ ID NO:52. SEQ ID 
3C NO:55.orSEQIDNO:61. 
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To determine the percent identity of two amino acid sequences or of 
two nucleic acids, the sequences are aligned for optimal comparison purposes 
(e.g.. gaps can be introduced in the sequence of a first amino acid or nucleic acid 
sequence for optimal alignment with a second amino or nucleic acid sequence). 
5 The amino acid residues or nucleotides at corresponding amino acid positions or 
nucleotide positions are then compared. When a position in the first sequence is 
occupied by the same amino acid residue or nucleotide as the corresponding 
position in the second sequence, then the molecules are identical at that position. 
The percent identity between the two sequences is a function of the number of 
•I 0 identical positions shared by the sequences (i.e., % identity = # of identical 
positions/total # of positions x 100). 

The determination of percent homology between two sequences can 
be accomplished using a mathematical algorithm. A preferred, non-limiting 
example of a mathematical algorithm utilized for the comparison of two 

1 5 sequences is the algorithm of Karlin and Altschul (1990) Proc. Nat'l Acad. Sci. \ 

USA 87:2264-2268. modified as in Karlin and Altschul (1993) Proc. Nat'l Acad. 
Sci. USA 90:5873-5877. Such an algorithm is incorporated into the NBLAST 
and XBLAST programs of Altschul, et al. (1990) J. Mol. Biol. 215:403-410. 
BLAST nucleotide searches can be performed with the NBLAST program, score 

2 0 =100. wordlength = 12 to obtain nucleotide sequences similar or homologous to 

CARD-3. CARD-4, CARD-5. or CARD-6 nucleic acid molecules of the 
invention. For example. Example 5 describes the use of the TBLASTN program 
to query a database of sequences of full length and partial cDNA sequences with 
the human CARD-4 polypeptide sequence leading to the discovery of murine 

2 5 CARD-4 and Example 4 describes the use of BLASTN to query a proprietary 

EST database with the 5' untranslated sequence of CARD-4 leading to the 
discover}' of two human CARD-4 splice variants. BLAST protein searches can 
be performed with the XBLAST program, score = 50. wordlength = 3 to obtain 
amino acid sequences homologous to CARD-3. CARD-4. CARD-5. or CARD-0 

3 0 protein molecules of the invention. To obtain gapped alignments for comparison 
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purposes. Gapped BLAST can be utilized as described in Altschul et al. (1997) 
Nucleic Acids Res. 25:3389-3402. When utilizing BLAST and Gapped BLAST 
programs, the default parameters of the respective programs (e.g., XBLAST and 
NBLAST) can be used. See http://vvww.ncbi.nlm.nih.gov. Another preferred. 
5 non-limiting example of a mathematical algorithm utilized for the comparison of 
sequences is the algorithm of Myers and Miller. CABIOS (1989). Such an 
algorithm is incorporated into the ALIGN program (version 2.0) which is pan of 
the GCG sequence alignment software package. When utilizing the ALIGN 
program for comparing amino acid sequences, a PAM120 weight residue table, a 
10 gap length penalty of 12. and a gap penalty of 4 can be used. 

The percent identity between two sequences can be determined using 
techniques similar to those described above, with or without allowing gaps. In 
calculating percent identity, typically exact matches are counted. 

The invention also provides CARD-3. CARD-4. CARD-5. or CARD- 
15 6 chimeric or fusion proteins. As used herein, a CARD-3. CARD-4. CARD-5. or 
CARD-6 "chimeric protein" or "fusion protein" comprises a CARD-3. CARD-4. 
CARD-5. or CARD-6 polypeptide operatively linked to a non-CARD-3. CARD- 
4. CARD-5. or CARD-6 polypeptide. A "CARD-3. CARD-4. CARD-5. or 
CARD-6 polypeptide" refers to a polypeptide having an amino acid sequence 
2 0 corresponding 10 all or a portion (preferably a biologically active portion) of a 
CARD-3, CARD-4. CARD-5. or CARD-6. whereas a "non-CARD-3. CARD-4. 
CARD-5. or CARD-6 polypeptide" refers to a polypeptide having an amino acid 
sequence corresponding to a protein which is not substantially identical to the 
CARD-3. CARD-4. CARD-5. or CARD-6 protein, e.g.. a protein which is 

2 5 different from the CARD-3. CARD-4. CARD-5. or CARD-6 proteins and which 

is derived from the same or a different organism. Within the fusion protein, the 
term "operatively linked" is intended to indicate that the CARD-3. CARD-4, 
CARD-5. or CARD-6 polypeptide and the non-CARD-3. CARD-4. CARD-5. or 
CARD-6 polypeptide are fused in-frame to each other. The heterologous 

3 0 polypeptide can be fused to the N-terminus or C-terminus of the CARD-3. 
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CARD-4, CARD-5, or CARD-6 polypeptide. 

One useful fusion protein is a GST fusion protein in which the 
CARD-3, CARD-4, CARD-5, or CARD-6 sequences are fused to the C-terminus 
of the GST sequences. Such fusion proteins can facilitate the purification of 
5 recombinant CARD-3, CARD-4, CARD-5, or CARD-6. In another embodiment, 
the fusion protein contains a signal sequence from another protein. In certain 
host cells (e.g., mammalian host cells), expression and/or secretion of CARD-3, 
CARD-4. CARD-5. or CARD-6 can be increased through use of a heterologous 
signal sequence. For example, the gp67 secretory sequence of the baculovirus 

10 envelope protein can be used as a heterologous signal sequence (Current 

Protocols in Molecular Biology, Ausubel et al., eds.. John Wiley & Sons. 1992 ). 
Other examples of eukaryotic heterologous signal sequences include the secretory 
sequences of melittin and human placental alkaline phosphatase (Stratagene; La 
Jolla, California). In yet another example, useful prokaryotic heterologous signal 

1 5 sequences include the phoA secretory signal (Molecular cloning. Sambrook et al. 
second edition. Cold spring harbor laboratory press, 1989) and the protein A 
secretory signal (Pharmacia Biotech; Piscataway. New Jersey). 

In yet another embodiment, the fusion protein is a CARD-3. CARD-4. 
CARD-5. or CARD-6-immunoglobulin fusion protein in which all or part of 

2 0 CARD-3. CARD-4, CARD-5, or CARD-6 is fused to sequences derived from a 
member of the immunoglobulin protein family. The CARD-3. CARD-4. CARD- 
5. or CARD-6-immunoglobulin fusion proteins of the invention can be 
incorporated into pharmaceutical compositions and administered to a subject to 
inhibit an interaction between a CARD-3. CARD-4, CARD-5. or CARD-6 ligand 

2 5 and a CARD-3. CARD-4. CARD-5. or CARD-6 protein on the surface of a cell. 

to thereby suppress CARD-3, CARD-4. CARD-5. or CARD-6-mediated signal 
transduction in vivo. The CARD-3. CARD-4. CARD-5. or CARD- 
6-immunoglobulin fusion proteins can be used to affect the bioavailability of a 
CARD-3. CARD-4. CARD-5, or CARD-6 cognate ligand. Inhibition of the 

3 0 CARD-3 ligand/CARD-3. CARD-4 ligand/CARD-4. CARD-5 ligand/CARD-5. 
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or CARD-6 ligand/CARD-6 interaction may be useful therapeutically for both 
the treatment of proliferative and differentiativc disorders, as well as modulating 
(e.g., promoting or inhibiting) cell survival. Moreover, the CARD-3, CARD-4, 
CARD-5, or CARD-6-immunoglobulin fusion proteins of the invention can be 
5 used as immunogens to produce anti-CARD-3, CARD-4, CARD-5, or CARD-6 
antibodies in a subject, to purify CARD-3, CARD-4. CARD-5. or CARD-6 
ligands and in screening assays to identify molecules which inhibit the interaction 
of CARD-3. CARD-4. CARD-5. or CARD-6 with a CARD-3. CARD-4. CARD- 
5. or CARD-6 iigand. 

1 0 Preferably, a CARD-3. CARD-4, CARD-5, or CARD-6 chimeric or 

fusion protein of the invention is produced by standard recombinant DNA 
techniques. For example, DNA fragments coding for the different polypeptide 
sequences are ligated together in-frame in accordance with conventional 
techniques, for example by employing blunt-ended or stagger-ended termini for 

15 ligation, restriction enzyme digestion to provide for appropriate termini, filling-in 
of cohesive ends as appropriate, alkaline phosphatase treatment to avoid 
undesirable joining, and enzymatic ligation, in another embodiment, the fusion 
gene can be synthesized by conventional techniques including automated DNA 
synthesizers. Alternatively. PCR amplification of gene fragments can be carried 

2 0 out using anchor primers which give rise to complementary overhangs between 
two consecutive gene fragments which can subsequently be annealed and 
reamplified to generate a chimeric uene sequence (see. e.g.. Current Protocols in 
Molecular Biology. Ausubel et al. eds., John Wiley &. Sons: 1992). Moreover, 
many expression vectors are commercially available that already encode a fusion 

2 5 moiety (e.g., a GST polypeptide). A CARD-3. CARD-4. CARD-5. or CARD-6- 
encoding nucleic acid can be cloned into such an expression vector such that the 
fusion moiety is linked in-frame to the CARD-3. CARD-4. CARD-5, or CARD-6 
protein. 

The present invention also pertains to variants of the CARD-3. 
2 0 CARD-4. CARD-5. or CARD-6 proteins which function as either CARD-3. 
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CARD-4. CARD-5. or CARD-6 agonists (mimetics) or as CARD-3. CARD-4. 
CARD-5. or CARD-6 antagonists. Variants of the CARD-3, CARD-4. CARD-5. 
or CARD-6 protein can be generated by mutagenesis, e.g., discrete point 
mutation or truncation of the CARD-3, CARD-4, CARD-5, or CARD-6 protein. 
5 An agonist of the CARD-3, CARD-4, CARD-5, or CARD-6 protein can retain 
substantially the same, or a subset, of the biological activities of the naturally 
occurring form of the CARD-3, CARD-4, CARD-5, or CARD-6 protein. An 
antagonist of the CARD-3, CARD-4. CARD-5. or CARD-6 protein can inhibit 
one or more of the activities of the naturally occurring form of the CARD-3, 

1 0 CARD-4. CARD-5. or CARD-6 protein by. for example, competitively binding 
to a downstream or upstream member of a cellular signaling cascade which 
includes the CARD-3. CARD-4. CARD-5. or CARD-6 protein. Thus, specific 
biological effects can be elicited by treatment with a variant of limited function. 
Treatment of a subject with a variant having a subset of the biological activities 

15 of the naturally occurring form of the protein can have fewer side effects in a 
subject relative to treatment with the naturally occurring form of the CARD-3. 
CARD-4. CARD-5. or CARD-6 proteins. 

. Variants of the CARD-3, CARD-4. CARD-5. or CARD-6 protein 
which function as either CARD-3. CARD-4. CARD-5. or CARD-6 agonists 

2 0 (mimetics) or as CARD-3. CARD-4. CARD-5. or CARD-6 antagonists can be 
identified by screening combinatorial libraries of mutants, e.g.. truncation 
mutants of the CARD-3. CARD-4. CARD-5. or CARD-6 protein for CARD-3. 
CARD-4. CARD-5. or CARD-6 protein agonist or antagonist activity. In one 
embodiment, a variegated library of CARD-3. CARD-4. CARD-5. or CARD-6 

2 5 variants is generated by combinatorial mutagenesis at the nucleic acid level and is 

encoded by a variegated gene library. A variegated library of CARD-3. CARD- 
4. CARD-5. or CARD-6 variants can be produced by. for example, enzymaticaliy 
ligating a mixture of synthetic oligonucleotides into gene sequences such that a 
degenerate set of potential CARD-3. CARD-4. CARD-5. or CARD-o sequences 

3 0 is expressible as individual polypeptides, or alternatively, as a set of larger fusion 
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proteins (e.g., for phage display) containing the set of CARD-3, CARD-4, 
CARD- 5. or CARD-6 sequences therein. There are a variety of methods which 
can be used to produce libraries of potential CARD-3. CARD-4, CARD-5, or 
CARD-6 variants from a degenerate oligonucleotide sequence. Chemical 
5 synthesis of a degenerate gene sequence can be performed in an automatic DNA 
synthesizer, and the synthetic gene then ligated into an appropriate expression 
vector. Use of a degenerate set of genes allows for the provision, in one mixture, 
of all of the sequences encoding the desired set of potential CARD-3, CARD-4, 
CARD-5. or CARD-6 sequences. Methods for synthesizing degenerate 

10 oligonucleotides are known in the art (see, e.g.. Narang (1983) Tetrahedron 39:3; 
ltakura et al. (1984) Annu. Rev. Biochem. 53:323; ltakura et al. (1984) Science 
198:1056; Ike et al. (1983) Nucleic Acid Res, 1 1:477). 

Useful fragments of CARD-3. CARD-4. CARD-5. and CARD-6. 
include fragments comprising or consisting of a domain or subdomain described 

15 herein, e.g.. a kinase domain or a CARD domain. 

In addition, libraries of fragments of the CARD-3. CARD-4. CARD- 
5. or CARD-6 protein coding sequence can be used to generate a variegated 
population of CARD-3. CARD-4. CARD-5. or CARD-6 fragments for screening 
and subsequent selection of variants of a CARD-3. CARD-4. CARD-5. or 

2 0 CARD-6 protein. In one embodiment, a library of coding sequence fragments 
can be generated by treating a double stranded PCR fragment of a CARD-3. 
CARD-4. CARD-5, or CARD-6 coding sequence with a nuclease under 
conditions wherein nicking occurs only about once per molecule, denaturing the 
double stranded DNA. renaturing the DNA to form double stranded DNA which 

2 5 can include sense/antisense pairs from different nicked products, removing single 

stranded portions from reformed duplexes by treatment with SI nuclease, and 
ligating the resulting fragment library into an expression vector. By this method, 
an expression library can be derived which encodes N-terminal and internal 
fragments of various sizes of the CARD-3. CARD-4. CARD-5. or CARD-6 

3 0 protein. 
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Several techniques are known in the art for screening gene products of 
combinatorial libraries made by point mutations or truncation, and for screening 
cDNA libraries for gene products having a selected property. Such techniques 
are adaptable for rapid screening of the gene libraries generated by the 
5 combinatorial mutagenesis of CARD-3. CARD-4, CARD-5. or CARD-6 
proteins. The most widely used techniques, which are amenable to high 
through-put analysis, for screening large gene libraries typically include cloning 
the gene library into replicable expression vectors, transforming appropriate cells 
with the resulting library of vectors, and expressing the combinatorial genes 

1 0 under conditions in which detection of a desired activity facilitates isolation of 
the vector encoding the gene whose product was detected. Recursive ensemble 
mutagenesis (REM), a. technique which enhances the frequency of functional 
mutants in the libraries, can be used in combination with the screening assays to 
identify CARD-3. CARD-4. CARD-5, or CARD-6 variants (Arkin and Yourvan 

15 (1992) Proc. Natl. Acad. Sci. USA 89:7811-7815: Delgrave et al. (1993) Protein 
Engineering 6(3):327-33 1 ). 

An isolated CARD-3. CARD-4. CARD-5. or CARD-6 protein, or a 
portion or fragment thereof, can be used as an immunogen to generate antibodies 
that bind CARD-3. CARD-4. CARD-5. or CARD-6 using standard techniques for 

2 0 polyclonal and monoclonal antibody preparation. The full-length CARD-3. 
CARD-4. CARD-5. or CARD-6 protein can be used or. alternatively, the 
invention provides antigenic peptide fragments of CARD-3. CARD-4, CARD-5. 
or CARD-6 for use as immunogens. The antigenic peptide of CARD-3. CARD-4. 
CARD-5. or CARD-6 comprises at least 8 (preferably 10. 15, 20. or 30) amino 

2 5 acid residues of the amino acid sequence shown in SEQ ID NO:2. SEQ ID NO:8. 

SEQ ID NO:26. SEQ ID NO:39, SEQ ID NO:4 1 . SEQ ID NO:43. SEQ ID 
NO:49. SEQ ID NO:52. SEQ ID NO:55. or SEQ ID NO:61 or polypeptides 
including amino acids 128-139 or 287-298 of human CARD-4L and encompasses 
an epitope of CARD-3. CARD-4. CARD-5. or CARD-6 such that an antibody 

3 0 raised against the peptide forms a specific immune complex with CARD-3. 
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CARD-4, CARD-5, or CARD-6. 

Useful antibodies include antibodies which bind to a domain or 
subdomain of CARD-3, CARD-4, CARD-5, or CARD-6 described herein (e.g., a 
kinase domain, a CARD domain, or a leucine-rich domain). 
5 Preferred epitopes encompassed by the antigenic peptide are regions 

of CARD-3, CARD-4, CARD-5, or CARD-6 that are located on the surface of 
the protein, e.g., hydrophilic regions. Other important criteria include a 
preference for a terminal sequence, high antigenic index (e.g.. as predicted by 
Jameson- Wolf algorithm), ease of peptide synthesis (e.g.. avoidance of prolines): 
10 and high surface probability (e.g.. as predicted by the Emini algorithm; Figure S 
and Figure 9). 

A CARD-3, CARD-4, CARD-5, or CARD-6 immunogen typically is 
used to prepare antibodies by immunizing a suitable subject, (e.g., rabbit, goat, 
mouse or other mammal) with the immunogen. An appropriate immunogenic 

15 preparation can contain, for example, recombinantly expressed CARD-3. CARD- 
4. CARD-5. or CARD-6 protein or a chemically synthesized CARD-3. CARD-4. 
CARD-5. or CARD-6 polypeptide. The preparation can further include an 
adjuvant, such as Freund's complete or incomplete adjuvant, or similar 
immunostimulatory agent. Immunization of a suitable subject with an 

2 0 immunogenic CARD-3. CARD-4, CARD-5. or CARD-6 preparation induces a 
polyclonal anti-CARD-3. CARD-4, CARD-5. or CARD-6 antibody response. 
For example, polypeptides including amino acids 128-139 or 287-298 of human 
CARD-4L were conjugated to KLH and the resulting conjugates were used to 
immunize rabbits and polyclonal antibodies that specifically recognize the two 

2 5 immunogen peptides were generated. 

Accordingly, another aspect of the invention pertains to anti-CARD-3. 
CARD-4. CARD-5, or CARD-6 antibodies. The term "antibody" as used herein 
refers to immunoglobulin molecules and immunologically active portions of 
immunoglobulin molecules, i.e.. molecules that contain an antigen binding site 

3 0 which specifically binds an antigen, such as CARD-3. CARD-4. CARD-5. or 
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CARD-6. A molecule which specifically binds to CARD-3, CARD-4. CARD-5, 
or CARD-6 is a molecule which binds CARD-3, CARD-4, CARD-5, or CARD- 
6, but does not substantially bind other molecules in a sample, e.g.. a biological 
sample, which naturally contains CARD-3, CARD-4. CARD-5. or CARD-6. 
5 Examples of immunologically active portions of immunoglobulin molecules 
include F(ab) and F(ab')2 fragments which can be generated by treating the 
antibody with an enzyme such as pepsin. The invention provides polyclonal and 
monoclonal antibodies that bind CARD-3, CARD-4, CARD-5, or CARD-6. The 
term "monoclonal antibody" or "monoclonal antibody composition", as used 

1 0 herein.- refers to a population of antibody molecules that contain only one species 

of an antigen binding site capable of immunoreacting with a particular epitope of 
CARD-3, CARD-4. CARD-5. or CARD-6. A monoclonal antibody composition 
thus Typically displays a single binding affinity for a particular CARD-3. CARD- 
4. CARD-5. or CARD-6 protein with which it immunoreacts. 
15 Polyclonal anti-CARD-3, CARD-4, CARD-5. or CARD-6 antibodies 

can be prepared as described above by immunizing a suitable subject with a 
CARD-3. CARD-4. CARD-5. or CARD-6 immunogen. The anti-CARD-3. 
CARD-4. CARD-5. or CARD-6 antibody titer in the immunized subject can be 
monitored over time by standard techniques, such as with an enzyme linked 

2 0 immunosorbent assay (EL1SA) using immobilized CARD-3. CARD-4. CARD-5. 

or CARD-6. If desired, the antibody molecules directed against CARD-3. 
CARD-4, CARD-5, or CARD-6 can be isolated from the mammal (e.g.. from the 
blood) and further purified by well-known techniques, such as protein A 
chromatography to obtain the IgG fraction. At an appropriate time after 

2 5 immunization, e.g.. when the anti-CARD-3. CARD-4. CARD-5. or CARD-6 

antibody titers are highest, antibody-producing cells can be obtained from the 
subject and used to prepare monoclonal antibodies by standard techniques, such 
as the hybridoma technique originally described by Kohler and Milsiein ( 1975) 
Nature 256:495-497. the human B cell hybridoma technique (kozbor et a!. ( 1983) 

3 0 Immunol Today 4:72). the EBV-hybridoma technique (Cole ei al. 1 1985). 
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Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96) or 
trioma techniques. The technology for producing various antibodies monoclonal 
antibody hybridomas is well known (see generally Current Protocols in 
Immunology (1994) Coligan et al. (eds.) John Wiley & Sons. Inc.. New York. 
5 NY). Briefly, an immortal cell line (typically a myeloma) is fused to 

lymphocytes (typically splenocytes) from a mammal immunized with a CARD-3. 
CARD-4, CARD-5, or CARD-6 immunogen as described above, and the culture 
supernatants of the resulting hybridoma cells are screened to identify a 
hybridoma producing a monoclonal antibody that binds CARD-3. CARD-4. 

1 0 CARD-5. or CARD-6. 

Any of the many well known protocols used for fusing lymphocytes 
and immortalized cell lines can be applied for the purpose of generating an 
anti-CARD-3, CARD-4. CARD-5, or CARD-6 monoclonal antibody (see. e.g., 
Current Protocols in Immunology, supra; Galfre et al. (1977) Nature 266:55052; 

15 R.H. Kenneth, in Monoclonal Antibodies: A New Dimension In Biological 
Analyses, Plenum Publishing Corp., New York, New York (1 980); and Lerner 
(1981) Yale J. Biol. Med.. 54:387-402). Moreover, the ordinarily skilled worker 
will appreciate that there are many variations of such methods which also would 
be useful. Typically, the immortal cell line (e.g.. a myeloma cell line) is derived 

2 0 from the same mammalian species as the lymphocytes. For example, murine 
hybridomas can be made by fusing lymphocytes from a mouse immunized with 
an immunogenic preparation of the present invention with an immortalized 
mouse cell line. e.g.. a myeloma cell line that is sensitive to culture medium 
containing hypoxanthine. aminopterin and thymidine ("HAT medium"). Any of a 

2 5 number of myeloma cell lines can be used as a fusion partner according to 

standard techniques, e.g.. the P3-NSl/l-Ag4-l. P3-x63-Ag8.653 or Sp2/0-Agl4 
myeloma lines. These myeloma lines are available from ATCC. Typically, 
HAT-sensitive mouse myeloma cells are fused to mouse splenocytes using 
polyethylene glycol ("PEG"). Hybridoma cells resulting from the fusion are then 

3 0 selected using HAT medium, which kills unfused and unproductively fused 
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myeloma cells (unfused splenocytes die after several days because they are not 
transformed). Hybridoma cells producing a monoclonal antibody of the 
invention are detected by screening the hybridoma culture supernatants for 
antibodies that bind CARD-3, CARD-4. CARD-5. or CARD-6, e.g.. using a 
5 standard ELISA assay. 

Alternative to preparing monoclonal antibody-secreting hybridomas. a 
monoclonal anti-CARD-3. CARD-4, CARD-5. or CARD-6 antibody can be 
identified and isolated by screening a recombinant combinatorial 
immunoglobulin library (e.g.. an antibody phage display library) with CARD-3. 

10 CARD-4. CARD-5. or CARD-6 to thereby isolate immunoglobulin library- 
members that bind CARD-3. CARD-4. CARD-5. or CARD-6. Kits for 
generating and screening phage display libraries are commercially available (e.g.. 
the Pharmacia Recombinant Phage Antibody System, Catalog No. 27-9400-01 ; 
and the Stratagene SurfZAP Phage Display Kit. Catalog No. 240612). 

15 Additionally, examples of methods and reagents particularly amenable for use in 
generating and screening antibody display library can be found in. for example. 
U.S. Patent No. 5.223.409; PCT Publication No. WO 92/18619: PCT Publication 
No. WO 91/17271; PCT Publication No. WO 92/20791; PCT Publication No. 
WO 92/15679; PCT Publication No. WO 93/01288; PCT Publication No. WO 

2 0 92/01047; PCT Publication No. WO 92/09690; PCT Publication No. WO 

90/02809; Fuchs et al. (1991) Bio/Technology 9:1370-1372; Hay et al. (1992) 
Hum. Antibod. Hybridomas 3:81-85; Huse et al. (1989) Science 246:1275-1281; 
Griffiths etal. (1993) EMBO J. 12:725-734. 

Additionally, recombinant anti-CARD-3. CARD-4. CARD-5. or 

2 5 CARD-6 antibodies, such as chimeric and humanized monoclonal antibodies. 

comprising both human and non-human portions, which can be made using 
standard recombinant DNA techniques, are within the scope of the invention. 
Such chimeric and humanized monoclonal antibodies can be produced by 
recombinant DNA techniques known in the an. for example using methods 

3 0 described in PCT Publication No. WO 87/02671 ; European Patent Application 
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184,187; European Patent Application 171,496; European Patent Application 
173,494; PCT Publication No. WO 86/01533; U.S. Patent No. 4,816,567; 
European Patent Application 125.023; Better et al. (1988) Science 240:1041- 
1043: Liu et al. (1987) Proc. Natl. Acad. Sci. USA 84:3439-3443; Liu et al. 
5 (1987) J. Immunol. 139:3521-3526; Sun et al. (1987) Proc. Natl. Acad. Sci. USA 
84:214-218; Nishimura et al. (1987) Cane. Res. 47:999-1005; Wood et al. (1985) 
Nature 314:446-449; and Shaw et al. (1988) J. Natl. Cancer Inst. 80:1553-1559); 
Morrison, (1985) Science 229:1202-1207; Oi et al. (1986) Bio/Techniques 4:214: 
U.S. Patent 5.225.539: Jones et al. (1986) Nature 321 :552-525; Verhoeyan et al. 

10 (1988) Science 239:1534; and Beidler et al. (1988) J. Immunol. 141:4053-4060. 

An anti-CARD-3, CARD-4, CARD-5. or CARD-6 antibody (e.g.. 
monoclonal antibody) can be used to isolate CARD-3. CARD-4. CARD-5, or 
CARD-6 by standard techniques, such as affinity chromatography or 
immunoprecipitation. An anti-CARD-3, CARD-4, CARD-5. or CARD-6 

15 antibody can facilitate the purification of natural CARD-3, CARD-4. CARD-5. or 
CARD-6 from cells and of recombinantly produced CARD-3, CARD-4, CARD- 
5. or CARD-6 expressed in host cells. Moreover, an anti-CARD-3. CARD-4. 
CARD-5. or CARD-6 antibody can be used to detect CARD-3. CARD-4. CARD- 
5. or CARD-6 protein (e.g.. in a cellular lysate or cell supernatant) in order to 

2 0 evaluate the abundance and pattern oi" expression of the CARD-3. CARD-4. 
CARD-5. or CARD-6 protein. Anti-CARD-3, CARD-4, CARD-5. or CARD-6 
antibodies can be used diagnostically to monitor protein levels in tissue as part of 
a clinical testing procedure, e.g.. to. for example, determine the efficacy of a 
given treatment regimen. Detection can be facilitated by coupling the antibody to 

2 5 a detectable substance. Examples of detectable substances include various 

enzymes, prosthetic groups, fluorescent materials, luminescent materials, 
bioluminescem materials, and radioactive materials. Examples of suitable 
enzymes include horseradish peroxidase, alkaline phosphatase. B-galactosidasc. 
or acetylcholinesterase: examples of suitable prosthetic group complexes include 

3 0 streptavidin/biotin and avidin/biotin: examples of suitable fluorescent materials 
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include umbelliferone, fluorescein, fluorescein isothiocyanate. rhodamine, 
dichlorotriazinylamine fluorescein, dansyl chloride or phycoerythrin; an example 
of a luminescent material includes luminol; examples of bioluminescent materials 
include luciferase, luciferin, and aequorin, and examples of suitable radioactive 
5 material include ,25 L I3I I, 35 S or 3 H. 

111. Recombinant Expression Vectors and Host Cells 

Another aspect of the invention pertains to vectors, preferably 
expression vectors, containing a nucleic acid encoding CARD-3, CARD-4. 
I o C'ARD-5. or CARD-6 (or a portion thereof). As used herein, the term "vector" 
refers to a nucleic acid molecule capable of transporting another nucleic acid to 
which it has been linked. One type of vector is a "plasmid". which refers to a 
circular double stranded DNA loop into which additional DN A segments can be 
ligated. Another type of vector is a viral vector, wherein additional DNA 

1 5 segments can be ligated into the viral genome. Certain vectors are capable of 

autonomous replication in a host cell into which they are introduced (e.g.. 
bacterial vectors having a bacterial origin of replication and episomal mammalian 
vectors). Other vectors (e.g., non-episomal mammalian vectors) are integrated 
into the genome of a host cell upon introduction into the host cell, and thereby are 

2 0 replicated along with the host genome. Moreover, certain vectors, expression 

vectors, are capable of directing the expression of genes to which they are 
operatively linked. In general, expression vectors of utility in recombinant DNA 
techniques are often in the form of plasmids (vectors). However, the invention is 
intended to include such other forms of expression vectors, such as viral vectors 

2 5 (e.g.. replication defective retroviruses, adenoviruses and adeno-associated 

viruses), which serve equivalent functions. 

The recombinant expression vectors of the invention comprise a 
nucleic acid of the invention in a form suitable for expression of the nucleic acid 
in a host cell, which means that the recombinant expression vectors include one 

3 0 or more regulatory sequences, selected on the basis of the host cells to be used for 
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expression, which is operatively linked to the nucleic acid sequence to be 
expressed. Within a recombinant expression vector, "operably linked" is 
intended to mean that the nucleotide sequence of interest is linked to the 
regulatory sequence(s) in a manner which allows for expression of the nucleotide 
5 sequence (e.g., in an in vitro transcription/translation system or in a host cell 
when the vector is introduced into the host cell). The term "regulatory sequence" 
is intended to include promoters, enhancers and other expression control elements 
(e.g.. polyadenylation signals). Such regulatory sequences are described, for 
example, in Goeddel: Gene Expression Technology: Methods in Enzymology 

10 185. Academic Press, San Diego. CA (1990). Regulatory sequences include 

those which direct constitutive expression of a nucleotide sequence in many types 
of host cell and those which direct expression of the nucleotide sequence only in 
certain host cells (e.g., tissue-specific regulatory sequences). It will be 
appreciated by those skilled in the art that the design of the expression vector can 

15 depend on such factors as the choice of the host cell to be transformed, the level 
of expression of protein desired, etc. The expression vectors of the invention can 
be introduced into host cells to thereby produce proteins or peptides, including 
fusion proteins or peptides, encoded by nucleic acids as described herein (e.g.. 
CARD-3. CARD-4. CARD-5, or CARD-6 proteins, mutant forms of CARD-3. 

2 0 CARD-4. CARD-5, or CARD-6, fusion proteins, etc.). 

The recombinant expression vectors of the invention can be designed 
for expression of CARD-3, CARD-4, CARD-5. or CARD-6 in prokaryolic or 
eukaryotic cells, e.g.. bacterial cells such as E. coli, insect cells (using 
baculovirus expression vectors) yeast cells or mammalian cells. Suitable host 

2 5 cells are discussed further in Goeddel. Gene Expression Technology: Methods in 

Enzymology 185. Academic Press. San Diego. CA (1990). Alternatively, the 
recombinant expression vector can be transcribed and translated in vitro, for 
example using T7 promoter regulatory sequences and T7 polymerase. 

Expression of proteins in prokaryotes is most often carried out in E. 

3 0 coli with vectors containing constitutive or inducible promoters directing the 
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expression of either fusion or non-fusion proteins. Fusion vectors add a number 
of amino acids to a protein encoded therein, usually to the amino terminus of the 
recombinant protein. Such fusion vectors typically serve three purposes: 1) to 
increase expression of recombinant protein; 2) to increase the solubility of the 
5 recombinant protein; and 3) to aid in the purification of the recombinant protein 
by acting as a ligand in affinity purification. Often, in fusion expression vectors, 
a proteolytic cleavage site is introduced at the junction of the fusion moiety and 
the recombinant protein to enable separation of the recombinant protein from the 
fusion moiety subsequent to purification of the fusion protein. Such enzymes, 

1 0 and their cognate recognition sequences, include Factor Xa. thrombin and 

enterokinase. Typical fusion expression vectors include pGEX (Pharmacia - 
Biotech Inc: Smith and Johnson ( 1 988) Gene 67:31-40), pMAL (New England 
Biolabs. Beverly. MA) and pRIT5 (Pharmacia. Piscataway, NJ) which fuse 
glutathione S-transferase (GST), maltose E binding protein, or protein A, 
1 5 respectively, to the target recombinant protein. 

Examples of suitable inducible non-fusion E. coli expression vectors 
include pTrc (Amann et al., (1988) Gene 69:301-315) and pET 1 Id (Studicr et 
al,. Gene Expression Technology: Methods in Enzymology 185. Academic Press. 
San Diego. California (1990) 60-89). Target gene expression from the pTrc 

2 0 vector relies on host RNA polymerase transcription from a hybrid trp-lac fusion 

promoter. Target gene expression from the pET 3 Id vector relies on transcription 
from a T7 gnlO-lac fusion promoter mediated by a coexpressed viral RNA 
polymerase (T7 gnl ). This viral polymerase is supplied by host strains 
BL21(DE3) or HMS174(DE3) from a resident k prophage harboring a T7 gnl 

2 5 gene under the transcriptional control of the )acUV5 promoter. 

One strategy to maximize recombinant protein expression in E. coli is 
to express the protein in a host bacteria with an impaired capacity to 
proteolytically cleave the recombinant protein (Goliesman. Gene Expression 
Technology: Methods in Enzymology 185. Academic Press. San Diego. 

3 0 California (1990) 1 19-128). Another strategy is-to alter the nucleic acid sequence 
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of the nucleic acid to be inserted into an expression vector so that the individual 
codons for each amino acid are those preferentially utilized in E, coli (Wada et al. 
(1992) Nucleic Acids Res. 20:21 1 1-21 18). Such alteration of nucleic acid 
sequences of the invention can be carried out by standard DNA synthesis 
5 techniques. 

In another embodiment, the CARD-3, CARD-4. CARD-5, or CARD- 
6 expression vector is a yeast expression vector. Examples of vectors for 
expression in yeast S. cerivisae include pYepSecl (Baldari et al. (1987) EMBO J. 
6:229-234). pMFa (Kurjan and Herskowitz, (1982) Cell 30:933-943), pJRY88 

1 o (Schultz et al. (1987) Gene 54:1 13-123), pYES2 (lnvitrogen Corporation. San 

Diego. CA), pGBT9 (Clontech, Palo Alto, CA), pGADIO (Clontech. Palo Alto. 
CA}. pYADE4 and pYGAE2 and pYPGE2 (Brunelli and Pall. (1993) Yeast 
9:1299-1308). pYPGEl 5 (Brunelli and Pall, (1993) Yeast 9:1309-1318), pACTII 
(Dr. S.E. Elledge. Baylor College of Medicine), and picZ (InVitrogen Corp, San 
15 Diego, CA). For example, in Example 7 the expression of a fusion protein 
comprising amino acids 1-145 of human CARD-4L fused to the DNA-binding 
domain of S. cerevisiae transcription factor GAL4 from the yeast expression 
vector pGBT9 is described. In another example. Example 8 describes the 
expression of a fusion protein comprising amino acids 406-953 of human CARD- 

2 0 4L fused to the DNA-binding domain of S. cerevisiae transcription factor GAL4 

from the yeast expression vector pGBT9. In yet another example. Example 7 
describes the expression of a fusion protein comprising CARD-3 fused to the 
transcriptional activation domain of S. cerevisiae transcription factor GAL4 from 
the yeast expression vector pACTII. 

2 5 Alternatively, CARD-3, CARD-4. CARD-5, or CARD-6 can be 

expressed in insect cells using baculovirus expression vectors. Baculovirus 
vectors available for expression of proteins in cultured insect cells (e.g.. Sf 9 
cells) include the pAc series (Smith et al. (1983) Mol. Cell Biol. 3:2156-2165) 
and the pVL series (Lucklow and Summers (1989) Virology 170:31-39). 

3 0 In yet another embodiment, a nucleic acid of the invention is 
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expressed in mammalian cells using a mammalian expression vector. Examples 
of mammalian expression vectors include pCDM8 (Seed (1987) Nature 329:840). 
pCI (Promega), and pMT2PC (Kaufman et al. (1987) EMBO J. 6:187-195). 
When used in mammalian cells, the expression vector's control functions are 
5 often provided by viral regulatory elements. For example, commonly used 

promoters are derived from polyoma, Adenovirus 2, cytomegalovirus and Simian 
Virus 40. For other suitable expression systems for both prokaryotic and 
eukaryotic cells see chapters 16 and 17 of Sambrook et al. (supra). For example. 
Example 9, Example 10. and Example 12 describe the expression of human 
10 CARD-4 or fragments thereof, CARD-3, or both from the mammalian expression 
vector pCL 

In another embodiment, the recombinant mammalian expression 
vector is capable of directing expression of the nucleic acid preferentially in a 
particular cell type (e.g.. tissue-specific regulator)' elements are used to express 

15 the nucleic acid). Tissue-specific regulatory elements are known in the art. Non- 
limiting examples of suitable tissue-specific promoters include the albumin 
promoter (liver-specific: Pinkerl et al. (1987) Genes Dev. 1:268-277), lymphoid- 
specific promoters (Calame and Eaton ( 1988) Adv. Immunol. 43:235-275). in 
particular promoters of T cell receptors (Winoto and Baltimore (1989) EMBO J. 

2 0 8:729-733) and immunoglobulins (Banerji et al. (1983) Cell 33:729-740: Queen 
and Baltimore (1983) Cell 33:741-748). neuron-specific promoters (e.g., the 
neurofilament promoter: Byrne and Ruddle (1989) Proc. Natl. Acad. Sci. USA 
86:5473-5477). pancreas-specific promoters (Edlund et al. (1985) Science 
230:912-916), and mammary gland-specific promoters (e.g., milk whey 

2 5 promoter; U.S. Patent No. 4.873.316 and European Application Publication No. 
264,166). Developmentallv-regulated promoters are also encompassed, for 
example the murine hox promoters (Kessel and Gruss (1990) Science 249:374- 
379) and the a-fetoproicm promoter (Cam pes and Tilghman (1989) Genes Dev. 
3:537-546). . . . * 

30 • . .... 
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The invention further provides a recombinant expression vector 
comprising a DNA molecule of the invention cloned into the expression vector in 
an antisense orientation. That is, the DNA molecule is operatively linked to a 
regulatory sequence in a manner which allows for expression (by transcription of 
5 the DNA molecule) of an RNA molecule which is antisense to CARD-3, CARD- 
4, CARD-5, or CARD-6 mRNA. Regulators' sequences operatively linked to a 
nucleic acid cloned in the antisense orientation can be chosen which direct the 
continuous expression of the antisense RNA molecule in a variety of cell types, 
for instance viral promoters and/or enhancers, or regulatory sequences can be 

10 chosen which direct constitutive, tissue specific or cell type specific expression of 
antisense RNA. The antisense expression vector can be in the form of a 
recombinant plasmid. phagemid or attenuated virus in which antisense nucleic 
acids are produced under the control of a high efficiency regulatory region, the 
activity of which can be determined by the cell type into which the vector is 

1 5 introduced. For a discussion of the regulation of gene expression using antisense 
genes see Weintraub et al. (Reviews - Trends in Genetics, Vol. 1(1)1 986). 

Another aspect of the invention pertains to host cells into which a 
recombinant expression vector of the invention or isolated nucleic acid molecule 
of the invention has been introduced. The terms "host cell" and "recombinant 

2 0 host cell" are used interchangeably herein. It is understood thai such terms refer 
not only to the particular subject cell but to the progeny or potential progeny of 
such a cell. Because certain modifications may occur in succeeding generations 
due to either mutation or environmental influences, such progeny may not, in 
fact, be identical to the parent cell, but are still included within the scope of the 

2 5 term as used herein. 

A host cell can be any prokaryotic or eukaryotic cell. For example, 
CARD-3. CARD-4. CARD-5, or CARD-6 protein can be expressed in bacterial 
cells such as E. coli. insect cells, yeast or mammalian cells (such as Chinese 
hamster ovary cells (CHO) or COS cells). Other suitable host cells are known to 

3 0 those skilled in the art. For example, in Example 7 a Saccharomyces cerevisiae 
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host cell for recombinant CARD-4 and CARD-3 expression is described, and in 
Examples 9, 10. and 12. a 293T host cells for expression of CARD-4 or 
fragments thereof or CARD-3 are described. 

Vector DNA or an isolated nucleic acid molecule of the invention can 
5 be introduced into prokaryotic or eukaryotic cells via conventional transformation 
or transfection techniques. As used herein, the terms "transformation" and 
"transfection" are intended to refer to a variety of art-recognized techniques for 
introducing foreign nucleic acid (e.g., DNA) into a host cell, including calcium 
phosphate or calcium chloride co-precipitation, DEAE-dextran-mediated 

1 0 transfection, lipofection, or electroporation. Suitable methods for transforming or 
transfecting host cells can be found in Sambrook, et al. (supra), and other ' 
laboratory manuals. 

For stable transfection of mammalian cells, it is known that, 
depending upon the expression vector and transfection technique used, only a 

15 small fraction of cells may integrate the foreign DNA into their genome. In some 
cases vector DNA is retained by the host cell. In other cases the host cell does 
not retain vector DNA and retains only an isolated nucleic acid molecule of the 
invention carried by the vector. In some cases, and isolated nucleic acid 
molecule of the invention is used to transform a cell without the use of a vector. 

2 0 In order to identify and select these integrants, a gene that encodes a 

selectable marker (e.g.. resistance to antibiotics) is generally introduced into the 
host cells along with the gene of interest. Preferred selectable markers include 
those which confer resistance to drugs, such as G418. hygromycin and 
methotrexate. Nucleic acid encoding a selectable marker can be introduced into a 

2 5 host cell on the same vector as that encoding CARD-3. CARD-4. CARD-5. or 

CARD-6 or can be introduced on a separate vector: Cells stably transfected with 
the introduced nucleic acid can be identified by drug selection (e.g.. cells that 
have incorporated the selectable marker gene will survive, while the other cells 
die). 

3 0 • 
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A host cell of the invention, such as a prokaryotic or eukaryotic host 
cell in culture, can be used to produce (i.e., express) a CARD-3. CARD-4, 
CARD-5. or CARD-6 protein. Accordingly, the invention further provides 
methods for producing CARD-3, CARD-4. CARD-5. or CARD-6 protein using 
5 the host cells of the invention. In one embodiment, the method comprises 
culturing the host cell of the invention (into which a recombinant expression 
vector or isolated nucleic acid molecule encoding CARD-3. CARD-4, CARD-5. 
or CARD-6 has been introduced) in a suitable medium such that CARD-3, 
CARD-4. CARD-5, or CARD-6 protein is produced. In another embodiment, the 

1 0 method further comprises isolating CARD-3, CARD-4, CARD-5, or CARD-6 
from the medium or the host cell. 

The host cells of the invention can also be used to produce nonhuman 
transgenic animals. For example, in one embodiment, a host cell of the invention 
is a fertilized oocyte or an embryonic stem cell into which CARD-3. CARD-4. 

1 5 CARD-5. or CARD-6-coding sequences have been introduced. Such host cells 
can then be used to create non-human transgenic animals in which exogenous 
CARD-3. CARD-4. CARD-5, or CARD-6 sequences have been introduced into 
their genome or homologous recombinant animals in which endogenous CARD- 
3. CARD-4, CARD-5. or CARD-6 sequences have been altered. Such animals 

2 0 are useful for studying the function and/or activity of CARD-3. CARD-4. CARD- 
5. or CARD-6 and for identifying and/or evaluating modulators of CARD-3. 
CARD-4, CARD-5. or CARD-6 activity. As used herein, a "transgenic animal" 
is a non-human animal, preferably a mammal, more preferably a rodent such as a 
rat or mouse, in which one or more of the cells of the animal includes a 

2 5 transgene. Other examples of transgenic animals include non-human primates, 
sheep, dogs. cows, goats, chickens, amphibians, etc. A transgene is exogenous 
DNA which is integrated into the genome of a cell from which a transgenic 
animal develops and which remains in the genome of the mature animal, thereby 
directing the expression of an encoded gene product in one or more cell types or 
3D tissues of the transgenic animal. As used herein, an "homologous recombinant 
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animal" is a non-human animal, preferably a mammal, more preferably a mouse, 
in which an endogenous CARD-3, CARD-4, CARD-5, or CARD-6 gene has 
been altered by homologous recombination between the endogenous gene and an 
exogenous DNA molecule introduced into a cell of the animal, e.g., an embryonic 
5 cell of the animal, prior to development of the animal. 

A transgenic animal of the invention can be created by introducing 
CARD-3, CARD-4. CARD-5, or CARD-6-encoding nucleic acid into the male 
pronuclei of a fertilized oocyte, e.g., by microinjection, retroviral infection, and 
allowing the oocyte to develop in a pseudopregnant female foster animal. The 

1C CARD-3. CARD-4. CARD-5. or CARD-6 cDNA sequence, e.g.. that of SEQ ID 
NO: 1 . SEQ ID NO:3. SEQ ID NO:7. SEQ ID NO:9, SEQ ID:25. SEQ ID NO:27. 
SEQ ID NO:38, SEQ ID NO:40. SEQ ID NO:42. SEQ ID NO:48, SEQ ID 
NO:50, SEQ ID NO:51. SEQ ID NO:53. SEQ ID NO:54, SEQ ID NO:56. SEQ 
ID NO:60. SEQ ID NO:62. or the cDNA of ATCC 203037. or the cDNA of 

15 ATCC 203035. or the cDNA of ATCC 203036. or the cDNA of ATCC PTA-21 L 
the cDNA of ATCC PTA-21 2, or the cDNA of ATCC PTA-21 3) can be 
introduced as a transgene into the genome of a non-human animal. Alternatively, 
a nonhuman homolog or ortholog of the human CARD-3. CARD-4. CARD-5. or 
CARD-6 gene, such as a mouse CARD-3. CARD-4. CARD-5. or CARD-6 gene. 

2 0 can be isolated based on hybridization to the human CARD-3. CARD-4. CARD- 
5. or CARD-6 cDNA and used as a transgene. For example, the mouse ortholog 
of CARD-4. Figure 15 and SEQ ID NO:42 can be used to make a transgenic 
animal using standard methods, lntronic sequences and polyadenylation signals 
can also be included in the transgene to increase the efficiency of expression of 

2 5 the transgene. A tissue-specific regulatory sequence(s) can be operably linked to 

the CARD-3. CARD-4. CARD-5. or CARD-6 transgene to direct expression of 
CARD-3. CARD-4. CARD-5. or CARD-6 protein to particular cells. Methods 
for generating transgenic animals via embryo manipulation and microinjection, 
particularly animals such as mice, have become conventional in the art and are 

3 0 described, for example, in U.S. Patent Nos. 4.736.866 and 4.870.009. U.S. Patent 
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No. 4.873,191 and in Hogan, Manipulating the Mouse Embryo, (Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1986). Similar methods are 
used for production of other transgenic animals. A transgenic founder animal can 
be identified based upon the presence of the CARD-3. CARD-4, CARD-5, or 
5 CARD-6 transgene in its genome and/or expression of CARD-3. CARD-4. 
CARD-5, or CARD-6 mRNA in tissues or cells of the animals. A transgenic 
founder animal can then be used to breed additional animals carrying the 
transgene. Moreover, transgenic animals carrying a transgene encoding CARD- 
S' CARD-4, CARD-5, or CARD-6 can further be bred to other transgenic animals 

1 0 carrying other transgenes. 

To create an homologous recombinant animal, a vector is prepared 
which contains at least a portion of a CARD-3, CARD-4, CARD-5. or CARD-6 
gene (e.g.. a human or a non-human homolog of the CARD-3, CARD-4, CARD- 
S' or CARD-6 gene, e.g., a murine CARD-3, CARD-4, CARD-5, or CARD-6 

15 gene) into which a deletion, addition or substitution has been introduced to 
thereby alter, e.g., functionally disrupt, the CARD-3, CARD-4. CARD-5. or 
CARD-6 gene. In an embodiment, the vector is designed such that, upon 
homologous recombination, the endogenous CARD-3, CARD-4, CARD-5. or 
CARD-6 gene is functionally disrupted (i.e.. no longer encodes a functional 

2 0 protein; also referred to as a "knock out" vector). Alternatively, the vector can be 
designed such that, upon homologous recombination, the endogenous CARD-3. 
CARD-4, CARD-5, or CARD-6 gene is mutated or otherwise altered but still 
encodes functional protein (e.g., the upstream regulatory region can be altered to 
thereby alter the expression of the endogenous CARD-3, CARD-4. CARD-5, or 

2 5 CARD-6 protein). In the homologous recombination vector, the altered portion of 

the CARD-3, CARD-4. CARD-5. or CARD-6 gene is flanked at its 5' and 3* ends 
by additional nucleic acid of the CARD-3. CARD-4. CARD-5. or CARD-6 gene 
to allow for homologous recombination to occur between the exogenous CARD- 
3. CARD-4. CARD-5. or CARD-6 gene carried by the vector and an endogenous 

3 0 CARD-3. CARD-4. CARD-5. or CARD-6 gene in an embryonic stem cell. The 
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additional flanking CARD-3 v CARD-4, CARD-5, or CARD-6 nucleic acid is of 
sufficient length for successful homologous recombination with the endogenous 
gene. Typically, several kilobases of flanking DNA (both at the 5' and 3' ends) 
are included in the vector (see, e.g., Thomas and Capecchi (1987) Cell 51:503 for 
5 a description of homologous recombination vectors). The vector is introduced 
into an embryonic stem cell line (e.g., by electroporation) and cells in which the 
introduced CARD-3, CARD-4, CARD-5, or CARD-6 gene has homologously 
recombined with the endogenous CARD-3. CARD-4, CARD-5, or CARD-6 gene 
are selected (see, e.g.. Li et al. (1992) Cell 69:915). The selected cells are then 

1 0 injected into a blastocyst of an animal (e.g., a mouse) to form aggregation 

chimeras (see. e.g.. Bradley in Teratocarcinomas and Embryonic Stem Cells: A 
Practical Approach. Robertson, ed. (IRL. Oxford, 1987) pp. 1 13-152). A 
chimeric embryo can then be implanted into a suitable pseudopregnant female 
foster animal and the embryo brought to term. Progeny harboring the 

15 homologously recombined DNA in their germ cells can be used to breed animals 
in which all cells of the animal contain the homologously recombined DNA by 
germline transmission of the transgene. Methods for constructing homologous 
recombination vectors and homologous recombinant animals are described 
further in Bradley (1991 ) Current Opinion in Bio/Technology 2:823-829 and in 

2 0 PCT Publication Nos. WO 90/1 1354. WO 91/01 140, WO 92/0968, and WO 
93/04169. 

In another embodiment, transgenic non-humans animals can be 
produced which contain selected systems which allow for regulated expression of 
the transgene. One example of such a system is the cre/loxP recombinase system 

2 5 of bacteriophage PI . For a description of the cre/loxP recombinase system, see. 
e.g.. Lakso et al. (1992) Proc. Natl. Acad. Sci. USA 89:6232-6236. Another 
example of a recombinase system is the FLP recombinase system of 
Saccharomyces cerevisiae (O'Gorman et al. ( 1 991 ) Science 25 1 : 1 35 1 -1 355. If a 
cre/loxP recombinase system is used to regulate expression of the transgene. 

2 0 animals containing transgenes encoding both the Cre recombinase and a selected 
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protein are required. Such animals can be provided through the construction of 
"double" transgenic animals, e.g., by mating two transgenic animals, one 
containing a transgene encoding a selected protein and the other containing a 
transgene encoding a recornbinase. 
5 Clones of the non-human transgenic animals described herein can also 

be produced according to the methods described in Wilmut et al. (1997) Nature 
385:810-813 and PCT Publication Nos. WO 97/07668 and WO 97/07669. In 
brief, a cell, e.g., a somatic cell, from the transgenic animal can be isolated and 
induced to exit the growth cycle and enter Go phase. The quiescent cell can then 

10 be fused, e.g., through the use of electrical pulses, to an enucleated oocyte from 
an animal of the same species from which the quiescent cell is isolated. The 
reconstructed oocyte is then cultured such that it develops to morula or blastocyte 
and then transferred to pseudopregnant female foster animal. The offspring 
borne of this female foster animal will be a clone of the animal from which the 

15 cell, e.g., the somatic cell, is isolated. 

IV. Pharmaceutical Compositions 

The CARD-3. CARD-4. CARD-5, or CARD-6 nucleic acid 
molecules, CARD-3, CARD-4. CARD-5, or CARD-6 proteins, and anti-CARD- 
2 0 3. CARD-4. CARD-5, or CARD-6 antibodies (also referred to herein as "active 
compounds") of the invention can be incorporated into pharmaceutical 
compositions suitable for administration. Such compositions typically comprise 
the nucleic acid molecule, protein, or antibody and a pharmaceutically acceptable 
carrier. As used herein the language "pharmaceutically acceptable carrier" is 

2 5 intended to include any and all solvents, dispersion media, coatings, antibacterial 

and antifungal agents, isotonic and absorption delaying agents, and the like, 
compatible with pharmaceutical administration. The use of such media and 
agents for pharmaceutically active substances is well known in the an. Except 
insofar as any conventional media or agent is incompatible with the active 

3 0 compound, use thereof in the compositions is contemplated. Supplementary 
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active compounds can also be incorporated into the compositions. 

The invention includes methods for preparing pharmaceutical 
compositions for modulating the expression or activity of a polypeptide or 
nucleic acid of the invention. Such methods comprise formulating a 
5 pharmaceutical^ acceptable carrier with an agent which modulates expression or 
activity of a polypeptide or nucleic acid of the invention. Such compositions can 
further include additional active agents. Thus, the invention further includes 
methods for preparing a pharmaceutical composition by formulating a 
pharmaceutically acceptable carrier with an agent which modulates expression or 

1 0 activity of a polypeptide or nucleic acid of the invention and one or more 

addtional active compounds. 

The agent which modulates expression or activity may, for example, 
be a small molecule. For example, such small molecules include peptides, 
peptidomimetics, amino acids, amino acid analogs, polynucleotides, 
15 polynucleotide analogs, nucleotides, nucleotide analogs, organic or inorganic 

compounds (i.e., including heteroorganic and organometallic compounds) having 
a molecular weight less than about 10.000 grams per mole, organic or inorganic 
compounds having a molecular weight les than about 5.000 grams per mole, 
organic or inorganic compounds having a molecular weight less than about 1 .000 

2 0 grams per mole, organic or inorganic compounds having a molecular weight less 

than about 500 grams per mole, and salts, esters, and other pharmaceutically 
acceptable forms of such compounds. It is understood that appropriate doses of 
small molecule agents depends upon a number of factors within the ken of the 
ordinarily skilled physician, veterinarian, or researcher. The dose(s) of the small 

2 5 molecule will van', for example, depending upon the identity, size, and condition 

of the subject or sample being treated, further depending upon the route by which 
the composition is to be administered, if applicable, and the effect which the 
practitioner desires the small molecule to have upon the nucleic acid or 
polypeptide of the invention. Exemplary doses include milligram or microgram 

3 0 amounts of the small molecule per kilogram of subject or sample weight (e.g.. 
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about 1 microgram per kilogram to about 500 milligrams per kilogram, about 100 
micrograms per kilogram to about 5 milligrams per kilogram, or about 1 
microgram per kilogram to about 50 micrograms per kilogram. It is furthermore 
understood that appropriate doses of a small molecule depend upon the potency 
5 of the small molecule with respect to the expression or activity to modulated. 
Such appropriate doses may be determined using the assays described herein. 
When one or more of these small molecules is to be administered to an animal 
(e.g.. a human) in order to modulate expression or activity of a polypeptide or 
nucleic acid of the invention, a physician, veterinarian, or researcher may, for 

1 0 example, prescribe a relatively low dose at first, subsequently increasing the dose 
until an appropriate response is obtained. In addition, it is understood that the 
specific dose level for any particular subject will depend upon a variety of factors 
including the activity of the specific compound employed, the age, body weight, 
general health, gender, and diet of the subject, the time of administration, the 

15 route of administration, the rate of excretion, any drug combination, and the 
degree of expression or activity to be modulated. 

A pharmaceutical composition of the invention is formulated to be 
compatible with its intended route of administration. Examples of routes of 
administration include parenteral, e.g.. intravenous, intradermal, subcutaneous. 

2 0 oral (e.g.. inhalation), transdermal (topical), transmucosal. and rectal 

administration. Solutions or suspensions used for parenteral, intradermal, or 
subcutaneous application can include the following components: a sterile diluent 
such as water for injection, saline solution, fixed oils, polyethylene glycols, 
glycerine, propylene glycol or other synthetic solvents; antibacterial agents such 

2 5 as benzyl alcohol or methyl parabens: antioxidants such as ascorbic acid or , 

sodium bisulfite; chelating agents such as ethylenediaminetetraacetic acid; 
buffers such as acetates, citrates or phosphates and agents for the adjustment of 
tonicity such as sodium chloride or dextrose. pH can be adjusted with acids or 
bases, such as hydrochloric acid or sodium hydroxide. The parenteral preparation 

3 0 can be enclosed in ampoules, disposable syringes or multiple dose vials made of 
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glass or plastic. 

Pharmaceutical compositions suitable for injectable use include sterile 
aqueous solutions ( where water soluble) or dispersions and sterile powders for 
the extemporaneous preparation of sterile injectable solutions or dispersion. For 
5 intravenous administration, suitable carriers include physiological saline, 
bacteriostatic water. Cremophor EL? (BASF; Parsippany, NJ) or phosphate 
buffered saline (PBS). In all cases, the composition must be sterile and should be 
fluid to the extent that easy syringability exists. It must be stable under the 
conditions of manufacture and storage and must be preserved against the 

10 contaminating action of microorganisms such as bacteria and fungi. The carrier 
can be a solvent or dispersion medium containing, for example, water, ethanoL 
polyol (for example, glycerol, propylene glycol, and liquid polyetheylene glycol, 
and the like), and suitable mixtures thereof. The proper fluidity can be 
maintained, for example, by the use of a coating such as lecithin, by the 

15 maintenance of the required particle size in the case of dispersion and by the use 
of surfactants. Prevention of the action of microorganisms can be achieved by 
various antibacterial and antifungal agents, for example, parabens. chlorobutanol. 
phenol, ascorbic acid, thimerosal. and the like. In many cases, it will be 
preferable to include isotonic agents, for example, sugars, polyalcohols such as 

2 0 inannitol. sorbitol, sodium chloride in the composition. Prolonged absorption of 
the injectable compositions can be brought about by including in the composition 
an agent which delays absorption, for example, aluminum monostearate and 
gelatin. 

Sterile injectable solutions can be prepared by incorporating the active 

2 5 compound (e.g.. a CARD-3, CARD-4, CARD-5. or CARD-6 protein or 

anti-CARD-3. CARD-4. CARD-5, or CARD-6 antibody) in the required amount 
in an appropriate solvent with one or a combination of ingredients enumerated 
above, as required, followed by filtered sterilization. Generally, dispersions are 
prepared by incorporating the active compound into a sterile vehicle which 

3 0 contains a basic dispersion medium and the required other ingredients from those 
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enumerated above. In the case of sterile powders for the preparation of sterile 
injectable solutions, the preferred methods of preparation are vacuum drying and 
freeze-drying which yields a powder of the active ingredient plus any additional 
desired ingredient from a previously sterile-filtered solution thereof. 

5 Oral compositions generally include an inert diluent or an edible 

currier. They can be enclosed in gelatin capsules or compressed into tablets. For 
the purpose of oral therapeutic administration, the active compound can be 
incorporated with excipients and used in the form of tablets, troches, or capsules. 
Oral compositions can also be prepared using a fluid carrier for use as a 

1 0 mouthwash, wherein the compound in the fluid carrier is applied orally and 
swished and expectorated or swallowed. Pharmaceutically compatible binding 
agents, and/or adjuvant materials can be included as part of the composition. The 
tablets, pills, capsules, troches and the like can contain any of the following 
ingredients, or compounds of a similar nature: a binder such as microcrystalline 

1 5 cellulose, gum tragacanth or gelatin; an excipient such as starch or lactose, a 
disintegrating agent such as alginic acid, Primogel, or corn starch; a lubricant 
such as magnesium stearate or Sterotes; a glidant such as colloidal silicon 
dioxide; a sweetening agent such as sucrose or saccharin; or a flavoring agent 
such as peppermint, methyl salicylate, or orange flavoring. For administration by 

2 0 inhalation, the compounds are delivered in the form of an aerosol spray from 
pressured container or dispenser which contains a suitable propellant. e.g.. a gas 
such as carbon dioxide, or a nebulizer. 

Systemic administration can also be by transmucosal or transdermal 
means. For transmucosal or transdermal administration, penetrants appropriate to 

2 5 the barrier to be permeated are used in the formulation. Such penetrants are 

generally known in the art, and include, for example, for transmucosal 
administration, detergents, bile salts, and fusidic acid derivatives. Transmucosal 
administration can be accomplished through the use of nasal sprays or 
suppositories. For transdermal administration, the active compounds are 

3 0 formulated into ointments, salves, gels, or creams as generally known in the an. 
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The compounds can also be prepared in the form of suppositories 
(e.g.. with conventional suppository bases such as cocoa butter and other 
glycerides) or retention enemas for rectal delivery. 

In one embodiment, the active compounds are prepared with carriers 
5 that will protect the compound against rapid elimination from the body, such as a 
controlled release formulation, including implants and microencapsulated 
deliver}- systems. Biodegradable, biocompatible polymers can be used, such as 
ethylene vinyl acetate, polyanhydrides, polyglycolic acid, collagen, 
polyorthoesters, and polylactic acid. Methods for preparation of such 
1 0 formulations will be apparent to those skilled in the art. The materials can also 
be obtained commercially from Alza Corporation and Nova Pharmaceuticals, Inc. 
Liposomal suspensions (including liposomes targeted to infected cells with 
monoclonal antibodies to viral antigens) can also be used as pharmaceutically 
acceptable carriers. These can be prepared according to methods known to those 

1 5 skilled in the art, for example, as described in U.S. Patent No. 4.522,81 1. 

It is especially advantageous to formulate oral or parenteral 
compositions in dosage unit form for ease of administration and uniformity of 
dosage. Dosage unit form as used herein refers to physically discrete units suited 
as unitary dosages for the subject to be treated; each unit containing a 

2 C predetermined quantity of active compound calculated to produce the desired 

therapeutic effect in association with the required pharmaceutical carrier. The 
specification for the dosage unit forms of the invention are dictated by and 
directly dependent on the unique characteristics of the active compound and the 
particular therapeutic effect to be achieved, and the limitations inherent in the art 

2 5 of compounding such an active compound for the treatment of individuals. 

For antibodies, the preferred dosage is 0.1 mg/kg to- 100 mg/kg of 
body weight (generally 1 0 mg/kg to 20 mg/kg). If the antibody is to act in the 
brain, a dosage of 50 mg/kg to 100 mg/kg is usually appropriate. Generally, 
partially human antibodies and fully human antibodies have a longer half-life 

3 0 within the human body than other antibodies. Accordingly, lower dosages and 
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less frequent administration is often possible. Modifications such as lipidation 
can be used to stabilize antibodies and to enhance uptake and tissue penetration 
(e.g., into the brain). A method for lipidation of antibodies is described by 
Cruikshank et al. ((1997) J. Acquired Immune Deficiency Syndromes and Human 
5 Retrovirology 14:193). 

The nucleic acid molecules of the invention can be inserted into 
vectors and used as gene therapy vectors. Gene therapy vectors can be delivered 
lo a subject by. for example, intravenous injection, local administration (U.S. 
Patent 5.328.470) or by stereotactic injection (see, e.g.. Chen et al. (1994) Proc. 
10 Natl. Acad. Sci. USA 91 :3054-3057). The pharmaceutical preparation of the 
gene therapy vector can include the gene therapy vector in an acceptable diluent, 
or can comprise a slow release matrix in which the gene delivery vehicle is 
imbedded. Alternatively, where the complete gene delivery vector can be 
produced intact from recombinant cells, e.g. retroviral vectors, the 
1 5 pharmaceutical preparation can include one or more cells which produce the gene 
deliver)' system. 

The pharmaceutical compositions can be included in a container, 
pack, or dispenser together with instructions for administration. 

2 0 V. Uses and Methods of the Invention 

The nucleic acid molecules, proteins, protein homologues. and 
antibodies described herein can be used in one or more of the following methods: 
a) screening assays: b) detection assays (e.g., chromosomal mapping, tissue 
tvping. forensic biology), c) predictive medicine (e.g.. diagnostic assays. 

2 5 prognostic assays, monitoring clinical trials, and pharmacogenomics); and d) 

methods of treatment (e.g., therapeutic and prophylactic). A CARD-3. CARD-4, 
CARD-5. or CARD-6 protein interacts with other cellular proteins and can thus 
be used for (i) regulation of cellular proliferation; (ii ) regulation of cellular 
differentiation: and (iii) regulation of cell survival. The isolated nucleic acid 

3 0 molecules ol* the invention can be used to express CARD-3. CARD-4. CARDo. 
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or CARD-6 protein (e.g., via a recombinant expression vector in a host cell in 
gene therapy applications), to detect CARD-3, CARD-4, CARD-5. or CARD-6 
mRNA (e.g., in a biological sample) or a genetic lesion in a CARD-3. CARD-4. 
CARD-5, or CARD-6 gene, and to modulate CARD-3. CARD-4. CARD-5, or 
5 CARD-6 activity. In addition, the CARD-3, CARD-4, CARD-5, or CARD-6 
proteins can be used to screen drugs or compounds which modulate the CARD-3. 
CARD-4, CARD-5, or CARD-6 activity or expression as well as to treat 
disorders characterized by insufficient or excessive production of CARD-3, 
CARD-4. CARD-5. or CARD-6 protein or production of CARD-3. CARD-4. 

1 0 CARD-5. or CARD-6 protein forms which have decreased or aberrant activity 

compared to CARD-3. CARD-4, CARD-5. or CARD-6 wild type protein. " In 
addition, the anti-CARD-3, CARD-4. CARD-5, or CARD-6 antibodies of the 
invention can be used to detect and isolate CARD-3. CARD-4, CARD-5. or 
CARD-6 proteins and modulate CARD-3. CARD-4, CARD-5, or CARD-6 
1 5 activity. 

This invention further pertains to novel agents identified by the above- 
described screening assays and uses thereof for treatments as described herein. 

A. Screening Assays 

2 0 The invention provides a method (also referred to herein as a 

"screening assay") for identifying modulators, i.e., candidate or test compounds 
or agents (e.g.. peptides, peptidomimetics. small molecules or other drugs) which 
bind to CARD-3. CARD-4. CARD-5. or CARD-6 proteins or biologically active 
portions thereof or have a stimulatory or inhibitory effect on. for example. 

2 5 CARD-3. CARD-4, CARD-5. or CARD-6 expression or CARD-3, CARD-4. 

CARD-5, or CARD-6 activity. An example of a biologically active portion of 
human CARD-4 is amino acids 1-145 encoding the CARD domain which is 
sufficient to exhibit CARD-3-binding activity as described in Example 7. Amino 
acids 406-953 of human CARD-4L comprising the leucine rich repeat domain 

3 0 represent a biologically active portion of CARD-4L because they possess 
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hNUDC-binding activity as described in Example 8. An example of a 
biologically active portion of human CARD-5 is amino acids 1 1 1-881 (SEQ ID 
NO:58) encoding the CARD domain. 

Among the screening assays provided by the invention are screening 
5 to identify molecules thai prevent the dimerization of a CARD-containing 
polypeptide of the invention, screening to identify molecules which block the 
binding of a CARD containing polypeptide to a CARD-containing polypeptide of 
the invention (e.g., CARD-4), screening to identify a competitive inhibitor of the 
binding of a nucleotide to the nucleotide binding site of a CARD-containing 

10 polypeptide of the invention, e.g., human CARD-4L. screening to identify 
compounds which block the interaction between the leucine-rich repeat of a 
CARD-containing polypeptide of the invention and a ligand which binds to the 
lcucine-rich repeat. 

For CARD-6 screening assays can be used to identify' molecules 

15 which modulate a CARD-6 mediated increase in transcription of genes having an 
AP-1 OR nf-KB binding site. For example, expression of a reporter under the 
control of NF-kB (or AP-1) is measured in the presence and absence of a 
candidate molecule and in the presence and absence of CARD-6 to identify those 
molecules which alter expression of the reporter in a CARD-6 dependent manner. 

2 0 In addition, screening assays can be used to identify molecules which modulate a 
CARD-6 mediated increase in CHOP phosphorylation. For example, the 
expression of a reporter gene under the control of CHOP is measured in the 
presence and absence of a candidate small molecule and in the presence and 
absence of CARD-6 to identify' those molecules which alter expression of the 

2 5 reporter in a CARD-6 dependent manner. A screening assay can be carried out to 

identify molecules which modulate the CARD-6 mediated increase in CHOP 
phosphorylation. For example, CHOP phosphorylation is measured in the 
presence and absence of a candidate molecule and in the presence and absence of 
CARD-6. Phosphorylation of CHOP can he measured using an antibody which 

3 0 binds to phosphoryiated CHOP, but not to non-phosphoryiated CHOP. 
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In one embodiment, the invention provides assays for screening 
candidate or test compounds which bind to or modulate the activity of a CARD- 
3, CARD-4. CARD-5, or CARD-6 proteins or polypeptides or biologically active 
portions thereof. The test compounds of the present invention can be obtained 
5 using any of the numerous approaches in combinatorial library methods known in 
the art. including: biological libraries; spatially addressable parallel solid phase or 
solution phase libraries; synthetic library methods requiring deconvolution; the 
"one-bead one-compound" library method; and synthetic library methods using 
affinity chromatography selection. The biological library approach is limited to 

10 peptide libraries, while the other four approaches are applicable to peptide, non- 
peptide oligomer or small molecule libraries of compounds (Lam (1997) r 
Anticancer Drug Des. 12: 145). Examples of methods for the synthesis of 
molecular libraries can be found in the art, for example in: DeWitt et al. (1993) 
Proc. Natl. Acad. Sci. U.S.A. 90:6909; Erb et al. ( 1 994) Proc. Natl. Acad. Sci. 

15 USA 91:1 1422; Zuckermann et al. (1994). J. Med. Chem. 37:2678; Cho et al. 
(1993) Science 261:1303; Carrell et al. (1994) Angew. Chem. Int. Ed. Engl. 
33:2059: Carell et al. (1994) Angew. Chem. Int. Ed. Engl. 33:2061; and Gallop et 
al. (1994) J. Med. Chem. 37:1233. 

Libraries of compounds may be presented in solution (e.g.. Houghten 

2 0 (1992) Bio/Techniques 13:412-421). or on beads (Lam (1991) Nature 354:82-84). 
chips (Fodor (1993) Nature 364:555-556). bacteria (U.S. Patent No. 5.223.409). 
spores (Patent Nos. 5.571.698: 5.403.484: and 5.223.409), plasmids (Cull et al. 
(1992) Proc, Natl. Acad. Sci. USA 89: 1 865-1 869) or on phage (Scott and Smith 
(1990) Science 249:386-390; Devlin (1990) Science 249:404-406; Cwirla et al. 

2 5 (1990) Proc. Natl. Acad. Sci. 87:6378-6382: and Felici ( 1991 ) J. Mol. Biol. 

222:301-310). 

Determining the ability of the test compound to modulate the activity 
of CARD-3. CARD-4. CARD-5. or CARD-6 or a biologically active portion 
thereof can be accomplished, for example, by determining the ability of the 

3 0 CARD-3. CARD-4. CARD-5. or CARD-6 protein to bind to or interact with a 
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CARD-3, CARD-4, CARD-5, or CARD-6 target molecule. As used herein, a 
"target molecule" is a molecule with which a CARD-3, CARD-4, CARD-5, or 
CARD-6 protein binds or interacts in nature, for example, a molecule associated 
with the internal surface of a cell membrane or a cytoplasmic molecule. A 
5 CARD-3. CARD-4, CARD-5, or CARD-6 target molecule can be a non-CARD- 
3, CARD-4, CARD-5, or CARD-6 molecule or a CARD-3, CARD-4, CARD-5. 
or CARD-6 protein or polypeptide of the present invention. In one embodiment, 
a CARD-3, CARD-4. CARD-5, or CARD-6 target molecule is a component of an 
apoptotic signal transduction pathway, e.g., CARD-3 and CARD-4. The target, 

10 for example, can be a second intracellular protein which has catalytic activity or a 
protein which facilitates the association of downstream signaling molecules with 
CARD-3, CARD-4. CARD-5, or CARD-6. In another embodiment, CARD-3, 
CARD-4, CARD-5, or CARD-6 target molecules include CARD-3 because 
CARD-3 was found to bind to CARD-4 (Examples 7 and 12) and hNUDC 

1 5 because hNUDC was found to bind to CARD-4 (Example 8). 

Determining the ability of the test compound to modulate the activity 
of CARD-3. CARD-4. CARD-5. or CARD-6 or a biologically active portion 
thereof can be accomplished, for example, by determining the ability of the 
CARD-3. CARD-4. CARD-5, or CARD-6 protein to bind to or interact with any 

2 0 of the specific proteins listed in the previous paragraph as CARD-3. CARD-4. 
CARD-5. or CARD-6 target molecules. In another embodiment. CARD-3. 
CARD-4, CARD-5. or CARD-6 target molecules include all proteins that bind to 
a CARD-3, CARD-4. CARD-5, or CARD-6 protein or a fragment thereof in a 
two-hybrid system binding assay which can be used without undue 

2 5 experimentation to isolate such proteins from cDNA or genomic two-hybrid 

system libraries. For example. Example 7 describes the use of the CARD-4 
CARD domain region to identify CARD-3 in a two-hybrid screen and Example 8 
describes the use of the CARD-4 leucine rich repeat domain region to identify 
hNUDC in a two-hybrid screen. The binding assays described in this section can 

3 0 be cell-based or cell free (described subsequently). 
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Determining the ability of the CARD-3, CARD-4, CARD-5. or 
CARD-6 protein to bind to or interact with a CARD-3, CARD-4, CARD-5, or 
CARD-6 target molecule can be accomplished by one of the methods described 
above for determining direct binding. In an embodiment, determining the ability 
5 of the CARD-3, CARD-4, CARD-5. or CARD-6 protein to bind to or interact 
with a CARD-3, CARD-4. CARD-5. or CARD-6 target molecule can be 
accomplished by determining the activity of the target molecule. For example, 
the activity of the target molecule can be determined by detecting induction of a 
cellular second messenger of the target (e.g., intracellular Ca2+. diacylglycerol. 

10 IP3. etc.), detecting catalytrc/enzymatic activity of the target on an appropriate 
substrate, detecting the induction of a reporter gene (e.g.. a CARD-3. CARD-4. . 
CARD-5. or CARD-6-responsive regulatory element operativeiy linked to a 
nucleic acid encoding a detectable marker, e.g. luciferase). or detecting a cellular 
response, for example, cell survival, cellular differentiation, or cell proliferation. 

15 For example, in Example 12 CARD-4 is shown to bind to CARD-3 and in 
Example 10. by monitoring a cellular response. CARD-4 is shown to enhance 
caspase 9 activity, cell death or apoptosis. Because CARD-3 and CARD-4 
enhance caspase 9 activity, activity can be monitored by assaying the caspase 9- 
mediated apoptosis cellular response or caspase 9 enzymatic activity. In addition. 

2 0 and in another embodiment, genes induced by CARD-3. CARD-4, CARD-5. or 
CARD-6 expression can be identified by expressing CARD-3. CARD-4. CARD- 
5. or CARD-6 in a cell line and conducting a transcriptional profiling experiment 
wherein the mRNA expression patterns of the cell line transformed with an 
empty expression vector and the cell line transformed with a CARD-3. CARD-4. 

2 5 CARD-5. or CARD-6 expression vector are compared. The promoters of genes 

induced by CARD-3. CARD-4, CARD-5. or CARD-6 expression can be 
operativeiy linked to reporter genes suitable for screening such as luciferase. 
secreted alkaline phosphatase, or beta-galactosidase and the resulting constructs 
could be introduced into appropriate expression vectors. A recombinant cell line 

3 0 containing CARD-3. CARD-4. CARD-5, or CARD-6 and transfected .with an 
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expression vector containing a CARD-3, CARD-4, CARD-5, or CARD-6 
responsive promoter operatively linked to a reporter gene can be used to identify 
test compounds that modulate CARD-3, CARD-4, CARD-5, or CARD-6 activity 
by assaying the expression of the reporter gene in response to contacting the 
5 recombinant cell line with test compounds. CARD-3, CARD-4, CARD-5, or 
CARD-6 agonists can be identified as increasing the expression of the reporter 
gene and CARD-3, CARD-4, CARD-5, or CARD-6 antagonists can be identified 
as decreasing the expression of the reporter gene. 

In another embodiment of the invention, the ability of a test 

1 0 compound to modulate the activity of CARD-3, CARD-4, or biologically active 
portions thereof can be determined by assaying the ability of the test compound 
to modulate CARD-3. CARD-4. CARD-5, or CARD-6-dependent pathways or 
processes where the CARD-3, CARD-4, CARD-5, or CARD-6 target proteins 
that mediate the CARD-3, CARD-4, CARD-5, or CARD-6 effect are known or 

1 5 unknown. Potential CARD-3, CARD-4, CARD-5, or CARD-6-dependent 

pathways or processes include, but are not limited to. the modulation of cellular 
signal transduction pathways and their related second messenger molecules (e.g., 
intracellular Ca :+ , diacylglycerol, IP3. cAMP etc.), cellular enzymatic activities, 
cellular responses (e.g.. cell survival, cellular differentiation, or cell 

2 0 proliferation), or the induction or repression of cellular or heterologous mRNAs 
or proteins. CARD-3. CARD-4, CARD-5, or CARD-6-dependent pathways or 
processes could be assayed by standard cell-based or cell free assays appropriate 
for the specific pathway or process under study. For example. Example 9 
describes how expression of CARD-4S or CARD-4L in 293T cells induces the 

2 5 NF-kB pathway as determined by the measurement of a cotransfected NF-kB 

pathway luciferasc reporter gene. In another embodiment, cells cotransfected 
with CARD-4 and the NF-kB luciferase reporter gene could be contacted with a 
test compound and test compounds that block CARD-4 activity could be 
identified by their reduction of CARD-4-dependeni NF-kB pathway luciferase 

3 0 reporter gene expression. Test compounds that agonize CARD-4 would be 
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expected to increase reporter gene expression, in another embodiment, CARD-4 
could be expressed in a cell line and the recombinant CARD-4-expressing cell 
line could be contacted with a test compound. Test compounds that inhibit 
CARD-4 activity could be indentified by their reduction of CARD-4-depended 
5 NF-kB pathway stimulation as measured by the assay of a NF-kB pathway 

reporter gene. NF-kB nuclear localization, IkB phosphorylation or proteolysis, or 
other standard assays for NF-kB pathway activation known to those skilled in the 
art 

In yet another embodiment, an assay of the present invention is a cell- 
1 C tree assay comprising contacting a CARD-3, CARD-4, CARD-5, or CARD-6 
protein or biologically active portion thereof with a test compound and 
determining the ability of the test compound to bind to the CARD-3. CARD-4, 
CARD-5. or CARD-6 protein or biologically active portion thereof Binding of 
the test compound to the CARD-3, CARD-4. CARD-5. or CARD-6 protein can 
15 be determined either directly or indirectly as described above. In one 

embodiment, a competitive binding assay includes contacting the CARD-3, 
CARD-4. CARD-5. or CARD-6 protein or biologically active portion thereof 
with a compound known to bind CARD-3. CARD-4, CARD-5. or CARD-6 to 
form an assay mixture, contacting the assay mixture with a lest compound, and 
2 0 determining the ability of the lest compound to interact with a CARD-3, CARD- 
4. CARD-5. or CARD-6 protein, wherein determining the ability of the test 
compound to interact with a CARD-3. CARD-4. CARD-5. or CARD-6 protein 
comprises determining the ability of the test compound to preferentially bind to 
CARD-3. CARD-4. CARD-5. or CARD-6 or biologically active portion thereof 

2 5 as compared to the known binding compound. . 

In another embodiment, an assay is a cell-free assay comprising 
contacting CARD-3. CARD-4. CARD-5. or CARD-6 protein or biologically 
active portion thereof with a test compound and determining the ability of the test 
compound to modulate (e.g.. stimulate or inhibit) the activity of the CARD-3. 

3 0 CARD-4. CARD-5. or CARD-6 protein or biologically active portion thereof. 
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Determining the ability of the test compound to modulate the activity of CARD- 

3. CARD-4, CARD-5. or CARD-6 can be accomplished, for example, by 
determining the ability of the CARD-3. CARD-4, CARD-5. or CARD-6 protein 
to bind to a CARD-3, CARD-4. CARD-5. or CARD-6 target molecule by one of 

5 the methods described above for determining direct binding. In an alternative 
embodiment, determining the ability of the test compound to modulate the 
activity of CARD-3. CARD-4. CARD-5. or CARD-6 can be accomplished by 
determining the ability of the C ARD-3. CARD-4, CARD-5, or CARD-6 protein 
to further modulate a CARD-3, CARD-4, CARD-5, or CARD-6 target molecule. 

10 For example, the catalytic/enzymatic activity of the target molecule on an 
appropriate substrate can be determined as previously described. 

In yet another embodiment, the cell-free assay comprises contacting 
the CARD-3, CARD-4. CARD-5, or CARD-6 protein or biologically active 
portion thereof with a known compound which binds CARD-3. CARD-4, 

15 CARD-5. or CARD-6 to form an assay mixture, contacting the assay mixture 
with a test compound, and determining the ability of the test compound to 
interact with a CARD-3. CARD-4. CARD-5. or CARD-6 protein, wherein 
determining the ability of the test compound to interact with a CARD-3. CARD- 

4, CARD-5, or CARD-6 protein comprises determining the ability of the CARD- 

2 0 3, CARD-4, CARD-5. or CARD-6 protein to preferentially bind to or modulate 

the activity of a CARD-3. CARD-4. CARD-5. or CARD-6 target molecule. The 
cell-free assays of the present invention are amenable to use of both the soluble 
form or the membrane-associated form of CARD-3. CARD-4. CARD-5. or 
CARD-6. A membrane-associated form of CARD-3. CARD-4. CARD-5. or 
25 CARD-6 refers to CARD-3. CARD-4. CARD-5. or CARD-6 that interacts with a 
membrane-bound target molecule. In the case of cell-free assays comprising the 
membrane-associated form of CARD-3. CARD-4. CARD-5. or CARD-6. it may 
be desirable to utilize a solubilizing agent such that the membrane-associated 
form of CARD-3. CARD-4. CARD-5. or CARD-6 is maintained in solution. 

3 0 Examples of such solubilizing agents include non-ionic detergents such as n- 
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octylglucoside, n-dodecylglucoside, n-dodecylmaltoside, octanoyl-N- 
methylglucamide, decanoyl-N-methylglucamide, Triton® X-100, Triton® X-l 14, 
Thesit®, Isotridecypoiy( ethylene glycol ether)n, 3-[(3- 
cholamidopropyl)dimethylamminio]-l -propane sulfonate (CHAPS), 3-[(3- 
5 cholamidopropyI)dimethylamminio]-2-hydroxy- ] -propane sulfonate (CHAPSO), 
or N-dodecyl=N.N-dimethyl-3-ammonio-l -propane sulfonate. 

In more than one embodiment of the above assay methods of the 
present invention, it may be desirable to immobilize either CARD-3, CARD-4. 
CARD-5. or CARD-6 or its target molecule to facilitate separation of complexed 

1 0 from uncomplexed forms of one or both of the proteins, as well as to 

accommodate automation of the assay. Binding of a test compound to CARD-3, 
CARD-4. CARD-5. or CARD-6, or interaction of CARD-3, CARD-4. CARD-5. 
or CARD-6 with a target molecule in the presence and absence of a candidate 
compound, can be accomplished in any vessel suitable for containing the 
15 reac.tants. Examples of such vessels include microtitre plates, test tubes, and 
micro-centrifuge tubes. In one embodiment, a fusion protein can be provided 
which adds a domain that allows one or both of the proteins to be bound to a 
matrix. For example. glutathione-S-transferase/CARD-3. CARD-4, CARD-5. or 
CARD-6 fusion proteins or glutathionc-S-transferase/target fusion proteins can 

2 0 be adsorbed onto glutathione sepharose beads (Sigma Chemical; St. Louis. MO) 

or glutathione derivatized microtitre plates, which are then combined with the test 
compound or the test compound and either the non-adsorbed target protein or 
CARD-3. CARD-4. CARD-5. or CARD-6 protein, and the mixture incubated 
under conditions conducive to complex formation (e.g.. at physiological 

2 5 conditions for salt and pH). Following incubation, the beads or microtitre plate 

wells are washed to remove any unbound components, the matrix immobilized in 
the case of beads, complex determined either directly or indirectly, for example, 
as described above. Alternatively, the complexes can be dissociated from the 
matrix, and the level of CARD-3. CARD-4. CARD-5. or CARD-6 binding or 

3 0 activity determined using standard techniques. In an alternative embodiment. 
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MYC or HA epitope tag CARD-3 or CARD-4 fusion proteins or MYC or HA 
epitope tag target fusion proteins can be adsorbed onto anti-MYC or anti-HA 
antibody coated microbeads or onto anti-MYC or anti-HA antibody coated 
microtitre plates, which are then combined with the test compound or the test 
5 compound and either the non-adsorbed target protein or CARD-3 or CARD-4 
protein, and the mixture incubated under conditions conducive to complex 
formation (e.g., at physiological conditions for salt and pH). Following 
incubation, the beads or microtitre plate wells are washed to remove any unbound 
components, the matrix immobilized in the case of beads, complex determined 

1 0 cither directly or indirectly, for example, as described above. Alternatively, the 

complexes can be dissociated from the matrix, and the level of CARD-3 or 
CARD-4 binding or activity determined using standard techniques. Example 12 
describes an HA epitope tagged CARD-4 protein that physically interacts in a 
. coimmunoprccipitation assay with MYC epitope tagged CARD-3. In an 
1 5 embodiment of the invention. HA epitope tagged CARD-4 could be used in 

combination with MYC epitope CARD-3 in the son of protein-protein interaction 
assay described earlier in this paragraph. 

Other techniques for immobilizing proteins on matrices can also be 
used in the screening assays of the invention. For example. CARD-3. CARD-4. 

2 0 CARD-5. or CARD-6 or its target molecule can be immobilized utilizing 

conjugation of biotin and streptavidin. Biotinylated CARD-3 or CARD-4 or 
target molecules can be prepared from biotin-NHS (N-hydroxy-succinimide) 
using techniques well known in the art (e.g.. biotinylation kit. Pierce Chemicals; 
Rockford. IL). and immobilized in the wells of streptavidin-coated % well plates 
25 (Pierce Chemical). Alternatively, antibodies reactive with CARD-3. CARD-4, 
CARD-5. CARD-6 or target molecules but which do not interfere with binding of 
the protein to its target molecule can be derivatized to the wells of the plate, and 
unbound target or protein trapped in the wells by antibody conjugation. Methods 
for detecting such complexes, in addition to those described above for the GST- 

3 0 immobilized complexes and epitope tag immobilized complexes, include 
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immunodetection of complexes using antibodies reactive with the CARD-3, 
CARD-4, CARD-5. or CARD-6 or target molecule, as well as enzyme-linked 
assays which rely on detecting an enzymatic activity associated with the CARD- 
3. CARD-4. CARD-5. CARD-6 or target molecule. 
5 In another embodiment, modulators of CARD-3. CARD-4. CARD-5. 

or. CARD-6 expression are identified in a method in which a cell is contacted 
with a candidate compound and the expression of the CARD-3. CARD-4, 
CARD-5. or CARD-6 promoter. mRNA or protein in the cell is determined. The 
level of expression of CARD-3, CARD-4. CARD-5. or CARD-6 mRNA or 

10 protein in the presence of the candidate compound is compared to the level of 
expression of CARD-3. CARD-4, CARD-5. or CARD-6 mRNA or protein in the 
absence of the candidate compound. The candidate compound can then be 
identified as a modulator of CARD-3. CARD-4, CARD-5. or CARD-6 
expression based on this comparison. For example, when expression of CARD- 

15 3. CARD-4. CARD-5. or CARD-6 mRNA or protein is greater (statistically 
significantly greater) in the presence of the candidate compound than in its 
absence, the candidate compound is identified as a stimulator of CARD-3. 
CARD-4. CARD-5. or CARD-6 mRNA or protein expression. Alternatively, 
when expression of CARD-3. CARD-4. CARD-5. or CARD-6 mRNA or protein 

2 0 is less (statistically significantly less) in the presence of the candidate compound 
than in its absence, the candidate compound is identified as an inhibitor of 
CARD-3. CARD-4. CARD-5. or CARD-6 mRNA or protein expression. The 
level of CARD-3. CARD-4. CARD-5. or CARD-6 mRNA or protein expression 
in the cells can be determined by methods described herein for detecting CARD- 

2 5 3, CARD-4. CARD-5, or CARD-6 mRNA or protein. The activity of the CARD- 

3. CARD-4. CARD-5. or CARD-6 promoter can be assayed by linking the 
CARD-3. CARD-4. CARD-5, or CARD-6 promoter to a reporter gene such as 
luciferase. secreted alkaline phosphatase, or beta-galactosidase and introducing 
the resulting construct into an appropriate vector, transfecting a host cell line, and 

3 0 measuring the activity of the reporter gene in response to test compounds. For 
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example, two C ARD-4-specific mRNAs were detected in a Northern blotting 
experiment, one of 4.6 kilobases and the other of 6.5-7.0 kilobases (Example 1 1 ). 
In Example 1 1 , CARD-4-specific mRN A species were found to be widely 
distributed in the tissues and cell lines studied. 
5 In yet another aspect of the invention, the CARD-3, CARD-4. CARD- 

5, or CARD-6 proteins can be used as "bait proteins" in a two-hybrid assay or 
three hybrid assay (see, e.g., U.S. Patent.No; 5,283,317; Zervos et al. (1993) Cell 
72:223-232; Madura et al. (1993) J. Biol. Chem. 268:12046-12054; Bartel et al. 
(1993) Bio/Techniques 14:920-924; Iwabuchi et al. (1993) Oncogene 8:1693- 

10 1696; and PCT Publication No. WO 94/10300), to identify other proteins, which 
bind to or interact with CARD-3, CARD-4. CARD-5. or CARD-6 ("CARD-3, 
CARD-4. CARD-5. or CARD-6-binding proteins" or "CARD-3, CARD-4. 
CARD-5, or CARD-6-bp") and modulate CARD-3, CARD-4. CARD-5, or 
CARD-6 activity. Such CARD-3. CARD-4. CARD-5. or CARD-6-binding 

15 proteins are also likely to be involved in the propagation of signals by the CARD- 
3. CARD-4. CARD-5. or CARD-6 proteins as. for example, upstream or 
downstream elements of the CARD-3, CARD-4. CARD-5, or CARD-6 pathway. 
For example. Example 7 describes the construction of a two-hybrid screening bait 
construct including human CARD-4L amino acids 1-145 comprising the CARD 

2 0 domain and the use of this bait construct to screen human mammary gland and 
prostate gland two-hybrid libraries resulting in the identification of human 
CARD-3 as a CARD-4 interacting protein. In another example. Example 8 
describes the construction of a two-hybrid screening bait construct including 
human CARD-4 amino acids 406-953 comprising the LRR domain and the use of 

2 5. this bait construct to screen a human mammary gland two-hybrid libraries 

resulting in the identification of hNUDC as a CARD-4 interacting protein. 

The two-hybrid system is based on the modular nature of most 
transcription factors, which consist of separable DNA-binding and activation 
domains. Briefly, the assay utilizes two different DNA constructs. In one 

3 0 construct, the gene that codes for CARD-3, CARD-4. CARD-5. or CARD-6 is 
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fused to a gene encoding the DNA binding domain of a known transcription 
factor (e.g., GAL-4); In the other construct, a DNA sequence, from a library of 
DNA sequences, that encodes an unidentified protein ("prey" or "sample") is 
fused to a gene that codes for the activation domain of the known transcription 
5 factor. If the "bait" and the "prey" proteins are able to interact, in vivo, forming 
an CARD-3. CARD-4, CARD-5, or CARD-6-dependent complex, the DNA- 
binding and activation domains of the transcription factor are brought into close 
proximity. This proximity allows transcription of a reporter gene (e.g.. LacZ) 
which is operably linked to a transcriptional regulatory site responsive to the 
1 0 transcription factor. Expression of the reporter gene can be detected and cell 

colonies containing the functional transcription factor can be isolated and used to 
obtain the cloned gene which encodes the protein which interacts with CARD-3. 
CARD-4. CARD-5. or CARD-6. 

In an embodiment of the invention, the ability of a test compound to 
15 modulate the activity of CARD-3, CARD-4. CARD-5, or CARD-6. or a 

biologically active portion thereof can be determined by assaying the ability of 
the test compound to block the binding of CARD-3. CARD-4. CARD-5, or - 
CARD-6 to its target proteins in a two-hybrid system assay. Example 7 describes 
a two-hybrid system assay for the interaction between CARD-3 and CARD-4 and 
2 0 Example 8 describes a two-hybrid system assay for the interaction between 

CARD-4 and its target protein hNUDC. To screen for test compounds that block 
the interaction between CARD-3 and CARD-4 and their target proteins, which 
include but are not limited to CARD-3. CARD-4. and hNUDC. a yeast two- 
hybrid screening strain coexpressing the interacting bait and prey constructs, for 

2 5 example, a CARD-4 bait construct and a CARD-3 prey construct as described in 

Example 7, is contacted with the test compound and the activity of the two- 
hybrid system reporter gene, usually HIS3. lacZ. or URA3 is assayed. If the 
strain remains viable but exhibits a significant decrease in reporter gene activity, 
this would indicate that the lest compound has inhibited the interaction between 

3 3 the bait and prey proteins. This assay could be automated lor high throughput 
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drug screening purposes, in another embodiment of the invention,CARD-3, 
CARD-4, CARD-5, or CARD-6 and their target proteins could be configured in 
the reverse two-hybrid system (Vidal et aL (1996) Proc. Natl. Acad. Sci. USA 
93 : 1 032 1 -6 and Vidal et al. ( 1 996) Proc. Natl. Acad. Sci. USA 93 : 1 03 1 5-20) 
5 designed specifically for efficient drug screening. In the reverse two-hybrid 
system, inhibition of a CARD-3 or CARD-4 physical interaction with a target 
protein would result in induction of a reporter gene in contrast to the normal two- 
hybrid system where inhibition of CARD-3. CARD-4. CARD-5. or CARD-6 
physical interaction with a target protein would lead to reporter gene repression. 

10 The reverse two-hybrid system is preferred for drug screening because reporter 
gene induction is more easily assayed than report gene repression. 

Alternative embodiments of the invention are proteins found to 
physically interact with proteins thai bind to CARD-3. CARD-4. CARD-5. or 
CARD-6. CARD-3, CARD-4. CARD-5. or CARD-6 interactors. including but 

15 not limited to hNUDC and CARD-3. could be configured into two-hybrid system 
baits and used in two-hybrid screens to identify additional members of the 
CARD-3. CARD-4, CARD-5. or CARD-6 pathway. The interactors of CARD-3. 
CARD-4. CARD-5. or CARD-6 interactors identified in this way could be useful 
targets for therapeutic intervention in CARD-3. CARD-4. CARD-5. or CARD-6 

2 0 related diseases and pathologies and an assay of their enzymatic or binding 

activity could be useful for the identification of test compounds that modulate 
CARD-3. CARD-4. CARD-5. or CARD-6 activity. 

This invention further pertains to novel agents identified by the above- 
described screening assays and uses thereof for treatments as described herein. 

25 

B. Detection Assays 

Portions or fragments of the cDNA sequences identified herein (and 
the corresponding complete gene sequences) can be used in numerous ways as 
polynucleotide reagents. For example, these sequences can be used to: (i) map 

3 0 their respective genes on a chromosome; and. thus, locate gene regions associated 
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with genetic disease; (ii) identify an individual from a minute biological sample 
(tissue typing); and (iii) aid in forensic identification of a biological sample. 
These applications are described in the subsections below. 
1 . Chromosome Mapping 
5 Once the sequence (or a' portion of the sequence) of a gene has been 

isolated, this sequence can be used to map the location of the gene on a 
chromosome. Accordingly, CARD-3. CARD-4, CARD-5, or CARD-6 nucleic 
acid molecules described herein or fragments thereof, can be used to map the 
location of CARD-3. CARD-4, CARD-5, or CARD-6 genes on a chromosome. 

10 Tne mapping of the CARD-3, CARD-4, CARD-5, or CARD-6 sequences to 

chromosomes is an important first step in correlating these sequences with genes 
associated with disease. 

Briefly, CARD-3. CARD-4, CARD-5. or CARD-6 genes can be 
mapped to chromosomes by preparing PCR primers (preferably 15-25 bp in 

15 length) from the CARD-3, CARD-4, CARD-5. or CARD-6 sequences. 

Computer analysis of CARD-3, CARD-4. CARD-5, or CARD-6 sequences can 
be used to rapidly select primers that do not span more than one exon in the . 
genomic DNA. thus complicating the amplification process. These primers can 
then be used for PCR screening of somatic cell hybrids containing individual 

2 0 human chromosomes. Only those hybrids containing the human gene 

corresponding to the CARD-3, CARD-4, CARD-5. or CARD-6 sequences will 
yield an amplified fragment. For example, in Example 6. human CARD-4- 
specific PCR primers were used to screen DNAs from a somatic cell hybrid panel 
showing that human CARD-4 maps to chromosome 7 close to the SHGC-31928 

2 5 genetic marker. 

Somatic cell hybrids arc prepared by fusing somatic cells from 
different mammals (e.g.. human and mouse cells). As hybrids of human and 
mouse cells grow and divide, they gradually lose human chromosomes in random 
order, but retain the mouse chromosomes. By using media in which mouse cells 

3 0 cannot grow, because they lack a particular enzyme, but human cells can. the one 
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human chromosome that contains the gene encoding the needed enzyme, will be 
retained. By using various media, panels of hybrid cell lines can be established. 
Each cell line in a panel contains either a single human chromosome or a small 
number of human chromosomes, and a full set of mouse chromosomes, allowing 
5 easy mapping of individual genes to specific human chromosomes. (D'Eustachio 
et al. ( 1 983) Science 220:91 9-924). Somatic cell hybrids containing only 
fragments of human chromosomes can also be produced by using human 
chromosomes with translocations and deletions. 

PCR mapping of somatic cell hybrids is a rapid procedure for 

1 0 assigning a particular sequence to a particular chromosome. Three or more 
sequences can be assigned per day using a single thermal cycler. Using the 
CARD-3, CARD-4. CARD-5. or CARD-6 sequences to design oligonucleotide 
primers, sublocalization can be achieved with panels of fragments from specific 
chromosomes. Other mapping strategies which can similarly be used to map a 

15 CARD-3. CARD-4. CARD-5, or CARD-6 sequence to its chromosome include in 
situ hybridization (described in Fan ct al. (1990) Proc. Natl. Acad. Sci. USA 
87:6223-27). pre-screening with labeled flow-sorted chromosomes, and pre- 
selection by hybridization to chromosome specific cDNA libraries. 

Fluorescence in situ hybridization (FISH) of a DNA sequence to a 

2 0 mctaphase chromosomal spread can further be used to provide a precise 

chromosomal location in one step. Chromosome spreads can be made using cells 
whose division has been blocked in metaphase by a chemical like colcemid thai 
disrupts the mitotic spindle. The chromosomes can be treated briefly with 
trypsin, and then stained with Giemsa. A pattern of light and dark bands 

2 5 develops on each chromosome, so that the chromosomes can be identified 

individually. The FISH technique can be used with a DNA sequence as short as 
500 or 600 bases. However, clones larger than 1.000 bases have a higher 
likelihood of binding to a unique chromosomal location with sufficient signal 
intensity for simple detection. Preferably ] .000 bases, and more preferably 2.000 

30 bases will suffice to get good results at a reasonable amount of time. For a 
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review of this technique, see Verma et al.« (Human Chromosomes: A Manual of 
Basic Techniques (Pergamon Press, New York. 1988)). 

Reagents for chromosome mapping can be used individually to mark a 
single chromosome or a single site on that chromosome, or panels of reagents can 
5 be used for marking multiple sites and/or multiple chromosomes. Reagents 
corresponding to noncoding regions of the genes actually are preferred for 
mapping purposes. Coding sequences are more likely to be conserved within 
gene families, thus increasing the chance of cross hybridizations during 
chromosomal mapping. 

1C Once a sequence has been mapped to a precise chromosomal location, 

the physical position of the sequence on the chromosome can be correlated w ith 
genetic map data. (Such data are found, for example, in V. McKusick, 
Mendelian Inheritance in Man. available on-line through Johns Hopkins 
University Welch Medical Library). The relationship between genes and disease. 

15 mapped to the same chromosomal region, can then be identified through linkage 
analysis (co-inheritance of physically adjacent genes), described in. e.g.. Egeland 
et al. (1987) Nature. 325:783-787. 

Moreover, differences in the DNA sequences between individuals 
affected and unaffected with a disease associated with the CARD-3. CARD-4. 

2 0 CARD-5, or CARD-6 gene can be determined. If a mutation is observed in some 
or all of the affected individuals but not in any unaffected individuals, then the 
mutation is likely to be the causative agent of the particular disease. Comparison 
of affected and unaffected individuals generally involves first looking for 
structural alterations in the chromosomes such as deletions or translocations that 

2 5 are visible from chromosome spreads or detectable using PCR based on that 
DNA sequence. Ultimately, complete sequencing of genes from several 
individuals can be performed to confirm the presence oi'a mutation and to 
distinguish mutations from polymorphisms. 
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2. Tissue Typing 

The CARD-3. CARD-4. CARD-5, or CARD-6 sequences of the 
present invention can also be used to identify individuals from minute biological 
samples. The United States military, for example, is considering the use of 
5 restriction fragment length polymorphism (RFLP) for identification of its 

personnel. In this technique, an individual's genomic DNA is digested with one 
or more restriction enzymes, and probed on a Southern blotto yield unique bands 
for identification. This method does not suffer from the current limitations of 
"Dog Tags" which can be lost, switched, or stolen, making positive identification 
10 difficult! The sequences of the present invention are useful as additional DNA 
markers for RFLP (described in U.S. Patent 5.272,057). 

Furthermore, the sequences of the present invention can be used to 
provide an alternative technique which determines the actual base-by-base DNA 
sequence of selected portions of an individual's genome. Thus, the CARD-3. 
15 CARD-4. CARD-5, or CARD-6 sequences described herein can be used to 

prepare two PCR primers from the 5' and 3' ends of the sequences. These primers 
can then be used to amplify an individual's DNA and subsequently sequence it. 

Panels of corresponding DNA sequences from individuals, prepared in 
this manner, can provide unique individual identifications, as each individual will 
2 0 have a unique set of such DNA sequences due to allelic differences. The 
sequences of the present invention can be used to obtain such identification 
sequences from individuals and from tissue. The CARD-3. CARD-4. CARD-5. 
or CARD-6 sequences of the invention uniquely represent portions of the human 
genome. Allelic variation occurs to some degree in the coding regions of these 

2 5 sequences, and to a greater degree in the noncoding regions. It is estimated that 

allelic variation between individual humans occurs with a frequency of about 
once per each 500 bases. Each of the sequences described herein can. to some 
degree, be used as a standard against which DNA from an individual can be 
compared for identification purposes. Because greater numbers of 

3 0 polymorphisms occur in the noncoding regions, fewer sequences are necessary to 
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differentiate individuals. The noncoding sequences of SEQ ID NO:l, SEQ ID 
NO:7. SEQ ID NO:25. SEQ ID NO:38. SEQ ID NO:40. SEQ ID NO:42. SEQ ID 
NO:48, SEQ ID NO:5L SEQ ID NO:54. and SEQ ID NO:60 can comfortably 
provide positive individual identification with a panel of perhaps 10 to 1.000 
5 primers which each yield a noncoding amplified sequence of 100 bases. If 

predicted coding sequences, such as those in SEQ ID NO:3. SEQ ID NO:9, SEQ 
II) NO;27. SEQ ID NO:50, SEQ ID NO:53. SEQ ID NO:56, and SEQ ID NO:62 
are used, a more appropriate number of primers for positive individual 
identification would be 500-2.000. 

1 . If a panel of reagents from CARD-3. CARD-4. CARD-5. or CARD-6 

sequences described herein is used to generate a unique identification database 
lor an individual, those same reagents can later be used to identify tissue from 
that individual. Using the unique identification database, positive identification 
of the individual, living or dead, can be made from extremely small tissue 
15 samples. 

3. Use of Partial Sequences in Forensic Biology 
DN A-based identification techniques can also be used in forensic 
. biology. Forensic biology is a scientific field employing genetic typing of 
biological evidence found at a crime scene as a means for positively identifying. 

2 0 for example, a perpetrator of a crime. To make such an identification. PCR 

technology can be used to amplify DNA sequences taken from very small 
biological samples such as tissues, e.g.. hair or skin, or body fluids, e.g.. blood, 
saliva, or semen found at a crime scene. The amplified sequence can then be 
compared to a standard, thereby allowing identification of the origin of the 

2 5 biological sample. 

The sequences of the present invention can be used to provide 
polynucleotide reagents, e.g.. PCR primers, targeted to specific loci in the human 
genome, which can enhance the reliability of DN A-based forensic identifications 
by. for example, providing another "identification marker" (i.e. another DNA 

3 0 sequence that is unique to a particular individual). As mentioned above, actual 

- 103 - 



BNSOOCID <WC 0100826A2 I * 



WO 01/00826 



PCT/US00/17691 



base sequence information can be used for identification as an accurate 
alternative to patterns formed by restriction enzyme generated fragments. 
Sequences targeted to noncoding regions of SEQ ID NO:l, SEQ ID NO:7. SEQ 
ID NO:25, SEQ ID NO:38, SEQ ID NO:40, SEQ ID NO:42. SEQ ID NO:48. 
5 SEQ ID NO:5 1 . SEQ ID NO: 54. and SEQ ID NO:60 are particularly appropriate 
for this use as greater numbers of polymorphisms occur in the noncoding regions, 
making it easier to differentiate individuals using this technique. Examples of 
polynucleotide reagents include the CARD-3, CARD-4, CARD-5. or CARD-6 
sequences or portions thereof, e.g.. fragments derived from the noncoding regions 

10 of SEQ ID NO: 1. SEQ ID NO:7. SEQ ID NO:25, SEQ ID NO:38. SEQ ID 
NO:40. SEQ ID NO:42. SEQ ID NO:48. SEQ ID NO:5 1 . SEQ ID NO:54. and 
SEQ ID NO:60 which have a length of at least 20 or 30 bases. 

The sequences described herein can further be used to provide 
polynucleotide reagents, e.g.. labeled or labelable probes which can be used in. 

15 for example, an in situ hybridization technique, to identify a specific tissue, e.g.. 
brain tissue. This can be very useful in cases where a forensic pathologist is 
presented with a tissue of unknown origin. Panels of such CARD-3. CARD-4. 
CARD-5, or CARD-6 probes can be used to identify tissue by species and/or by 
organ type. 

2 0 In a similar fashion, these reagents, e.g.. CARD-3, CARD-4. CARD- 

5. or CARD-6 primers or probes can be used to screen tissue culture lor 
contamination (i.e., screen for the presence of a mixture of different types of cells 
in a culture). 

2 5 C. Predictive Medicine 

The present invention also pertains to the field of predictive medicine 
in which diagnostic assays, prognostic assays, pharmacogenomics. and 
monitoring clinical trials are used for prognostic (predictive) purposes to thereb> 
treat an individual prophylactically. Accordingly one aspect of the present 

3 0 invention relates to diagnostic assays for determining CARD-3. CARD-4. 
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CARD-5, or CARD-6 protein and/or nucleic acid expression as well as CARD-3. 
CARD-4. CARD-5, or CARD-6 activity, in the context of a biological sample 
(e.g.. blood, serum, cells, tissue) to thereby determine whether an individual is 
afilicted with a disease or disorder, or is at risk of developing a disorder. 
5 associated with aberrant CARD-3. CARD-4. CARD-5, or CARD-6 expression or 
activity. The invention also provides for prognostic (or predictive) assays for 
determining whether an individual is at risk of developing a disorder associated 
with CARD-3, CARD-4, CARD-5. or CARD-6 protein, nucleic acid expression 
or activity. For example, mutations in a CARD-3. CARD-4. CARD-5, or CARD- 

10 6 gene can be assayed in a biological sample. Such assays can be used for 

prognostic or predictive purpose to thereby prophylactically treat an individual 
prior to the onset of a disorder characterized by or associated with CARD-3, 
CARD-4. CARD-5. or CARD-6 protein, nucleic acid expression or activity. 

Another aspect of the invention provides methods for determining 

15 CARD-3. CARD-4, CARD-5, or CARD-6 protein, nucleic acid expression or 
CARD-3. CARD-4. CARD-5. or CARD-6 activity in an individual to thereby 
select appropriate therapeutic or prophylactic agents for thai individual (referred 
to herein as "pharmacogenomics"). Pharmacogenomics allows for the selection 
of agents (e.g.. drugs) for therapeutic or prophylactic treatment of an individual 

2 0 based on the genotype of the individual (e.g.. the genotype of the individual 

examined to determine the ability of the individual to respond to a particular 
agent.) 

Yet another aspect of the invention pertains to monitoring the 
influence of agents (e.g.. drugs or other compounds) on the expression or activity 
25 of CARD-3. CARD-4. CARD-5. or CARD-6 in clinical trials. 

These and other agents are described in further detail in the following 

sections. 

1 . Diagnostic Assays 

.An exemplary method for detecting the presence or absence of 

3 0 CARD-3. CARD-4. CARD-5. or CARD-6 in a biological sample involves 
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obtaining a biological sample from a test subject and contacting the biological 
sample with a compound or an agent capable of detecting CARD-3, CARD-4. 
CARD-5. or CARD-6 protein or nucleic acid (e.g.. mRNA. genomic DNA) that 
encodes CARD-3. CARD-4, CARD-5. or CARD-6 protein such that the presence 
5 of CARD-3. CARD-4, CARD-5, or CARD-6 is detected in the biological sample. 
An agent for detecting CARD-3, CARD-4. CARD-5. or CARD-6 mRNA or 
genomic DNA is a labeled nucleic acid probe capable of hybridizing to CARD-3, 
CARD-4. CARD-5. or CARD-6 mRNA or genomic DNA. The nucleic acid 
probe can be. for example, a full-length CARD-3, CARD-4. CARD-5. or CARD- 

I C 0 nucleic acid, such as the nucleic acid of SEQ ID NO: 1 or 3. SEQ ID NO:7 or 9. 
SEQ ID NO:25 or 27, SEQ ID NO:38. SEQ ID NO:40, SEQ ID NO:42, SEQ ID 
\ r O:4S or 50. SEQ ID NO:5 1 or SEQ ID NO:53 or SEQ ID NO:54 or SEQ ID 
NO:56. SEQ ID NO:60 or 62. or a portion thereof, such as an oligonucleotide of 
at least 15. 30. 50, 100, 250 or 500 nucleotides in length and sufficient to 

15 specifically hybridize under stringent conditions to mRNA or genomic DNA, or a 
human CARD-4 splice variant such as the nucleic acid of SEQ ID NO:38 or SEQ 
ID NO:40. Other suitable probes for use in the diagnostic assays of the invention 
are described herein. For example. Example 1 1 describes the use of a nucleic- 
acid probe to detect CARD-4 mRNAs in human tissues and cell lines and the 

2 0 probe used in this experiment could be used for a diagnostic assay. 

An agent for detecting CARD-3. CARD-4. CARD-5. or CARD-6 
protein can be an antibody capable of binding to CARD-3, CARD-4. CARD-5. or 
CARD-6 protein, preferably an antibody with a detectable label. Antibodies can 
be polyclonal, or more preferably, monoclonal. For example, polypeptides 

2 5 corresponding to amino acids 128-139 and 287-298 of human CARD-4L were 

used to immunize rabbits and produce polyclonal antibodies that specifically 
recognize human CARD-4L. An intact antibody, or a fragment thereof (e.g.. Fab 
or F(ab')2) can be used. The term "labeled", with regard to the probe or antibody, 
is intended to encompass direct labeling of the probe or antibody by coupling 

3 0 (i.e.. physically linking) a detectable substance to the probe or antibody, as well 
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as indirect labeling of the probe or antibody by reactivity with another reagent 
that is directly labeled. Examples of indirect labeling include detection of a 
primary antibody using a fluorescently labeled secondary antibody and end- 
labeling of a DNA probe with biotin such that ii can be detected with 
5 fluorescently labeled streptavidin. The term "biological sample' 1 is intended to 
include tissues, cells and biological fluids isolated from a subject, as well as 
tissues, cells and fluids present within a subject. That is, the detection method of 
the invention can be used to detect CARD-3. CARD-4. CARD-5. or CARD-6 
mRNA. protein, or genomic DNA in a biological sample in vitro as well as in 

1 0 vivo. For example, in vitro techniques for detection of CARD-3. CARD-4. 
CARD-5. or CARD-6 mRNA include Northern hybridizations and in situ 
hybridizations. For example. Example 1 1 contains the use of a human CARD-4L 
nucleic acid probe for a Northern blotting analysis of mRNA species encoded by 
human CARD-4L detected in RNA samples from human tissues and cell lines. 

15 In vitro techniques for detection of CARD-3 or CARD-4 protein include enzyme 
linked immunosorbent assays (ELlSAs). Western blots, immunoprecipitations 
and immunofluorescence. In vitro techniques for detection of CARD-3. CARD- 
4. CARD-5. or CARD-6 genomic DNA include Southern hybridizations. 
Furthermore, in vivo techniques for detection of CARD-3. CARD-4. CARD-5. or 

2 0 CARD-6 protein include introducing into a subject a labeled anti-CARD-3. 
CARD-4. CARD-5. or CARD-6 antibody. For example, the antibody can be 
labeled with a radioactive marker whose presence and location in a subject can be 
detected by standard imaging techniques. 

In one embodiment, the biological sample contains protein molecules 

2 5 from ;the test subject. Alternatively, the biological sample can contain mRNA 

molecules from the test subject or genomic DNA molecules from the test subject. 
A biological sample is a peripheral blood leukocyte sample isolated by 
conventional means from a subject. 

In another embodiment, the methods further involve obtaining a 

3 0 control biological sample from a control subject, contacting the control sample 
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with a compound or agent capable of detecting CARD-3, CARD-4, CARD-5, or 
CARD-6 protein, mRNA, or genomic DNA, such that the presence of CARD-3, 
CARD-4, CARD-5, or CARD-6 protein, mRNA or genomic DNA is detected in 
the biological sample, and comparing the presence of CARD-3. CARD-4. 
5 CARD-5. or CARD-6 protein, mRNA or genomic DNA in the control sample 
with the presence of CARD-3, CARDr4. CARD-5. or CARD-6 protein. mRNA 
or genomic DNA in the test sample. 

The invention also encompasses kits for detecting the presence of 
CARD-3. CARD-4. CARD-5, or CARD-6 in a biological sample (a test sample). 

1 0 Such kits can be used to determine if a subject is suffering from or is at increased 
risk of developing a disorder associated with aberrant expression of CARD-3. 
CARD-4. CARD-5, or CARD-6 (e.g.. an immunological disorder). For example, 
the kit can comprise a labeled compound or agent capable of detecting CARD-3. 
CARD-4. CARD-5. or CARD-6 protein or mRNA in a biological sample and 

1 5 means for determining the amount of CARD-3. CARD-4. CARD-5. or CARD-6 
in the sample (e.g.. an anti-CARD-3. CARD-4. CARD-5, or CARD-6 antibody or 
an oligonucleotide probe which binds to DNA encoding CARD-3, CARD-4. 
CARD-5, or CARD-6. e.g.. SEQ ID NO:l. SEQ ID NO:3. SEQ ID NO:7. SEQ 
ID NO:9. SEQ ID NO:25. SEQ ID NO:27. SEQ ID NO:38. SEQ ID NO:40. SEQ 

2 0 ID NO:42, SEQ IS NO:48. SEQ ID NO:50. SEQ ID NO:5 1 . SEQ ID NO:53. 
SEQ ID NO:54. SEQ ID NO:56, SEQ ID NO:60. or SEQ ID NO:62). Kits may 
also include instruction for observing that the tested subject is suffering from or is 
at risk of developing a disorder associated with aberrant expression of CARD-3. 
CARD-4. CARD-5. or CARD-6 if the amount of CARD-3, CARD-4. CARD-5. 

2 5 or CARD-6 protein or mRNA is above or below a normal level. 

For antibody-based kits, the kit may comprise, for example: (1 ) a first 
antibody (e.g.. attached to a solid support) which binds to CARD-3, CARD-4. 
CARD-5. or CARD-6 protein: and. optionally. (2) a second, different antibody 
which binds to CARD-3. CARD-4. CARD-5. or CARD-6 protein or ihe first 

3 C antibody and is conjugated to a detectable agent. 
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For oligonucleotide-based kits, the kit may comprise, for example: ( 1 ) 
a oligonucleotide, e.g.. a detectably labelled oligonucleotide, which hybridizes to 
a CARD-3, CARD-4. CARD-5, or CARD-6 nucleic acid sequence or (2) a pair of 
primers useful for amplifying a CARD-3. CARD-4, CARD-5. or CARD-6 
5 nucleic acid molecule. 

The kit may also comprise, e.g.. a buffering agent, a preservative, or a 
protein stabilizing agent. The kit may also comprise components necessary for 
detecting the detectable agent (e.g., an enzyme or a substrate). The kit may also 
contain a control sample or a series of control samples which can be- assayed and 

1 C compared to the test sample contained. Each component of the kit is usually 

enclosed within an individual container and all of the various containers are 
within a single package along with instructions for observing whether the tested 
subject is suffering from or is at risk of developing a disorder associated with 
aberrant expression of CARD-3, CARD-4, CARD-5. or CARD-6. 
15 2. Prognostic Assays 

The methods described herein can furthermore be utilized as 
diagnostic or prognostic assays to identity' subjects having or at risk of 
developing a disease or disorder associated with aberrant CA.RD-3. CARD-4. 
CARD-5. or CARD-6 expression or activity. For example, the assays described 

2 0 herein, such as the preceding diagnostic assays or the following assays, can be 

utilized to identify a subject having or at risk of developing a disorder associated 
with CARD-3. CARD-4. CARD-5. or CARD-6 protein, nucleic acid expression 
or activity. Alternatively, the prognostic assays can be utilized to identify a 
subject having or at risk for developing such a disease or disorder. Thus, the 
2 5 present invention. provides a method in which a test sample is obtained from a 
subject and CARD-3. CARD-4. CARD-5. or CARD-6 protein or nucleic acid 
(e.g.. mRNA. genomic DNA) is delected, wherein the presence of CARD-3. 
CARD-4; CARD-5. or CARD-6 protein or nucleic acid is diagnostic for a subject 
having or at risk of developing a disease or disorder associated with aberrant . 
CARD-3. CARD-4. CARD-5. or CARD-6 expression or activity As used 
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herein, a "test sample'* refers to a biological sample obtained from a subject of 
interest. For example, a test sample can be a biological fluid (e.g.. serum), cell 
sample, or tissue. Furthermore, the prognostic assays described herein can be 
used to determine whether a subject can be administered an agent (e.g.. an 
5 agonist, antagonist, peptidomimetic, protein, peptide, nucleic acid, small 

molecule, or other drug candidate) to treat a disease or disorder associated with 
aberrant CARD-3. CARD-4, CARD-5, or CARD-6 expression or activity. For 
example, such methods can be used to determine whether a subject can be 
effectively treated with a specific agent or class of agents (e.g.. agents of a type 

10 which decrease CARD-3. CARD-4. CARD-5. or CARD-6 activity). Thus, the 
present invention provides methods for determining whether a subject can be 
effectively treated with an agent for a disorder associated with aberrant CARD-3. 
CARD-4. CARD-5. or CARD-6 expression or activity in which a test sample is 
obtained and CARD-3. CARD-4, CARD-5. or CARD-6 protein or nucleic acid is 

15 detected (e.g.. wherein the presence of CARD-3. CARD-4. CARD-5. or CARD-6 
protein or nucleic acid is diagnostic for a subject that can be administered the 
agent to treat a disorder associated with aberrant CARD-3, CARD-4. CARD-5. or 
CARD-6 expression or activity). 

The methods of the invention can also be used to detect genetic 

2 0 lesions or mutations in a CARD-3, CARD-4. CARD-5. or CARD-6 gene, thereby 
determining if a subject with the lesioned gene is at risk for a disorder 
characterized by aberrant cell proliferation and/or differentiation. In preferred 
embodiments, the methods include detecting, in a sample of cells from the 
subject, the presence or absence of a genetic lesion characterized by at least one 

2 5 of an alteration affecting the integrity of a gene encoding a CARD-3. CARD-4. 
CARD-5. or CARD-6-protein. or the mis-expression of the CARD-3. CARD-4. 
CARD-5, or CARD-6 gene. For example, such genetic lesions can be detected 
by ascertaining the existence of at least one of 1 ) a deletion of one or more 
nucleotides from a CARD-3. CARD-4. CARD-5. or CARD-6 gene: 2) an 

3G addition of one or more nucleotides to a CARD-3. CARD-4. CARD-5. or CARD- 
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6 gene; 3) a substitution of one or more nucleotides of a CARD-3, CARD-4, 
CARD-5. or CARD-6 gene; 4) a chromosomal rearrangement of a CARD- 3. 
CARD-4, CARD-5. or CARD-6 gene; 5) an alteration in the level of a messenger 
RNA transcript of a CARD-3, CARD-4, CARD-5. or CARD-6 gene; 6) aberrant 
5 modification of a CARD-3, CARD-4. CARD-5. or CARD-6 gene, such as of the 
methylation pattern of the genomic DN A; 7) the presence of a non-wild type 
splicing pattern of a messenger RNA transcript of a CARD-3. CARD-4, CARD- 
5. or CARD-6 gene (e.g. caused by a mutation in a splice donor or splice acceptor 
site); 8) a non-wild type level of a CARD-3. CARD-4. CARD-5. or CARD-6- 

10 protein; 9) allelic loss of a CARD-3. CARD-4. CARD-5, or CARD-6 gene; and 
1 0) inappropriate post-translational modification of a CARD-3. CARD-4. _ 
CARD-5. or CARD-6-protein. As described herein, there are a large number of 
assay techniques kxiovvn in the art which can be used for detecting lesions in a 
CARD-3. CARD-4, CARD-5. or CARD-6 gene. A biological sample is a 

15 peripheral blood leukocyte sample isolated by conventional means from a 
subject. 

In certain embodiments, detection of the lesion involves the use of a 
probe/primer in a polymerase chain reaction (PCR) (see. e.g., U.S. Patent Nos. 
4,683,195 and 4.683.202). such as anchor PCR or RACE PCR, or. alternatively. 
2 0 in a ligation chain reaction (LCR) (see. e.g.. Landegran et al. (1988) Science 

241:1077-1080; and Nakazawa et al. (1994) Proc. Natl. Acad. Sci. USA 91:360- 
364). the latter of which can be particularly useful for detecting point mutations 
in the CARD-3 or CARD-4-gene (see. e.g.. Abravaya et al. (1995) Nucleic Acids 
Res. 23:675-682). This method can include the steps of collecting a sample of 

2 5 cells from a patient, isolating nucleic acid (e.g.. genomic. mRNA or both) from 

the cells of the sample, contacting the nucleic acid sample with one or more 
primers which specifically hybridize to a CARD-3. CARD-4. CARD-5. or 
CARD-6 gene under conditions such that hybridization and amplification of the 
CARD-3. CARD-4. CARD-5. or CARD-6-gene (if present) occurs, and detecting 

3 0 the presence or absence of an amplification product, or detecting the size of the 
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amplification product and comparing the length to a control sample. It is 
anticipated that PCR and/or LCR may be desirable to use as a preliminary 
amplification step in conjunction with any of the techniques used for detecting 
mutations described herein. 
5 Alternative amplification methods include: self sustained sequence 

replication (Guatelli et al. (1990) Proc. Natl. Acad. Sci. USA 87:1874-1878), 
transcriptional amplification system (Kwoh. et al. (1989) Proc. Natl. Acad. Sci. 
USA 86:1 173-1 177), Q-Beta Replicase (Lizardi et al. (1988) Bio/Technology 
6: 1 1 97). or any other nucleic acid amplification method, followed by the 

1 C detection of the amplified molecules using techniques well known to those of 
skill in the art. These detection schemes are especially useful for the detection of 
nucleic acid molecules if such molecules are present in very low numbers. 

In an alternative embodiment, mutations in a CARD-3. CARD-4. 
CARD-5. or CARD-6 gene from a sample cell can be identified by alterations in 

1 5 restriction enzyme cleavage patterns. For example, sample and control DN A is 
isolated, amplified (optionally), digested with one or more restriction 
endonucieases. and fragment length sizes are determined by gel electrophoresis 
and compared. Differences in fragment length sizes between sample and control 
DNA indicates mutations in the sample DNA. Moreover, the use of sequence 

2 0 specific ribozymes (see. e.g.. U.S. Patent No. 5.498,531 ) can be used to score for 
the presence of specific mutations by development or loss of a ribozyme cleavage 
site. 

In other embodiments, genetic mutations in CARD-3. CARD-4. 
CARD-5. or CARD-6 can be identified by hybridizing a sample and control 

2 5 nucleic acids, e.g., DNA or RNA. to high density arrays containing hundreds or 

thousands of oligonucleotides probes (Cronin et al. ( 1996) Human Mutation 
7:244-255: Kozal et al. ( 1 996) Nature Medicine 2:753-759). For example, 
genetic mutations in CARD-3 or CARD-4 can be identified in two-dimensional 
arrays containing light-generated DNA probes as described in Cronin et al. supra. 

3 0 Briefly, a first hybridization array of probes can be used to scan through long 
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stretches of DN A in a sample and control to identify base changes between the 
sequences by making linear arrays of sequential overlapping probes. This step 
allows the identification of point mutations. This step is followed by a second 
hybridization array that allows the characterization of specific mutations by using 
5 smaller, specialized probe arrays complementary to all variants or mutations 
detected. Each mutation array is composed of parallel probe sets, one 
complementary to the wild-type gene and the other complementary to the mutant 
gene. 

In yet another embodiment, any of a variety of sequencing reactions 

10 known in the an can be used to directly sequence the CARD-3. CARD-4. CARD- 
5. or CARD-6 gene and detect mutations by comparing the sequence of the 
sample CARD-3. CARD-4. CARD-5. or CARD-6 with the corresponding wild- 
type (control) sequence. Examples of sequencing reactions include those based 
on techniques developed by Maxam and Gilbert (( 1 977) Proc. Natl. Acad. Sci. 

15 USA 74:560) or Sanger ((1977) Proc. Natl. Acad. Sci. USA 74:5463). It is also 
contemplated that any of a variety of automated sequencing procedures can be 
utilized when performing the diagnostic assays ((1995) Bio/Techniques 19:448). 
including sequencing by mass spectrometry (see. e.g.. PCT Publication No. WO 
94/16101; Cohen et al. (1996) Adv. Chromatogr. 36:127-162; and Griffin etal. 

20 (1993) Appl. Biochem. Biotechnol. 38:147-159). 

Other methods for detecting mutations in the CARD-3 or CARD-4 
gene include methods in which protection from cleavage agentsis used to detect 
mismatched bases in RNA/RNA or RNA/DNA heteroduplexes (IVlyers et al. 
( 1 985) Science 230: 1242). In general, the an technique of "mismatch cleavage" 

2 5 starts by providing heteroduplexes of formed by hybridizing ( labeled) RNA or 
DNA containing the wild-type CARD-3. CARD-4. CARD-5. or CARD-6 
sequence with potentially mutant RNA or DNA obtained from a tissue sample. 
The double-stranded duplexes are treated with an agent which cleaves single- 
stranded regions of the duplex such as which will exist due to basepair 

?■ 0 mismatches between the control and sample strands. For instance. RNA/DNA 
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duplexes can be treated with RNase and DNA/DNA hybrids treated with SI 
nuclease to enzymatically digesting the mismatched regions. In other 
embodiments, either DNA/DNA or RNA/DNA duplexes can be treated with 
hydroxylamine or osmium tetroxide and with piperidine in order to digest 
5 mismatched regions. After digestion of the mismatched regions, the resulting 
material is then separated by size on denaturing polyacrylamide gels to determine 
the site of mutation. See, e.g.. Cotton et al (1988) Proc. Natl Acad Sci USA 
85:4397: Saleeba et al (1992) Methods Enzymol. 217:286-295. In an 
embodiment, the control DNA or RNA can be labeled for detection. 

1 0 'n still another embodiment, the mismatch cleavage reaction employs 

one or more proteins that recognize mismatched base pairs in double-stranded 
DNA (so called "DNA mismatch repair" enzymes) in defined systems for 
detecting and mapping point mutations in CARD-3. CARD-4. CARD-5, or 
CARD-6 cDNAs obtained from samples of cells. For example, the mutY enzyme 

15 of E. coli cleaves A at G/A mismatches and the thymidine DNA glycosylase from 
HeLa cells cleaves T at G/T mismatches (Hsu et ah ( 1 994) Carcinogenesis 
15: 1657-1662). According to an exemplary embodiment, a probe based on a 
CARD-3, CARD-4. CARD-5, or CARD-6 sequence, e.g., a wild-type CARD-3. 
CARD-4. CARD-5. or CARD-6 sequence, is hybridized to a cDNA or other 

2 0 DNA product from a test cell(s). The duplex is treated with a DNA mismatch 
repair enzyme, and the cleavage products, if any. can be delected from 
electrophoresis protocols or the like. See. e.g., U.S. Patent No. 5.459.039. 

In other embodiments, alterations in electrophorctic mobility will be 
used to identify mutations in CARD-3. CARD-4. CARD-5. or CARD-6 genes. 

2 5 For example.-single strand conformation polymorphism (SSCP) may be used to 

detect differences in electrophoretic mobility between mutant and wild type 
nucleic acids (Orita et al. ( 1 989) Proc Natl. Acad. Sci USA: 86:2766. see also 
Cotton (1993) Mutat. Res. 285:125-144; and Hayashi (1992) Genet Anal Tech 
Appl 9:73-79). Single-stranded DNA fragments of sample and control CARD-3 

3 0 or CARD-4 nucleic acids will be denatured and allowed to renature. The 
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secondary structure of single-stranded nucleic acids varies according to sequence, 
the resulting alteration in electrophoretic mobility enables the detection of even a 
single base change. The DNA fragments may be labeled or detected with labeled 
probes. The sensitivity of the assay may be enhanced by using RNA (rather than 
5 DNA), in which the secondary structure is more sensitive to a change in 

sequence. In an embodiment, the subject method utilizes heteroduplex analysis 
to separate double stranded heteroduplex molecules on the basis of changes in 
electrophoretic mobility (Keen et al. (1991) Trends Genet 7:5). 

In yet another embodiment, the movement of mutant or wild-type 

1 0 fragments in polyacrylamide gels containing a gradient of denaturant is assayed 

using denaturing gradient gel electrophoresis (DGGE) (Myers et al. (1985) 
Nature 3 13:495). When DGGE is used as the method of analysis. DNA will be 
modified to insure that it does not completely denature, for example by adding a 
GC clamp of approximately 40 bp of high-melting GC-rich DNA by PCR. In a 
15 further embodiment, a temperature gradient is used in place of a denaturing 
gradient to identify differences in the mobility of control and sample DNA 
(Rosenbaum and Reissner ( 1987) Biophys Chem 265:12753). 

Examples of other techniques for detecting point mutations include, 
but are not limited to, selective oligonucleotide hybridization, selective 

2 0 amplification, or selective primer extension. For example, oligonucleotide 

primers may be prepared in which the known mutation is placed centrally and 
then hybridized to target DNA under conditions which permit hybridization only 
if a perfect match is found (Saiki et al. (1986) Nature 324:163); Saiki et al. (1989) 
Proc. Natl Acad. Sci USA 86:6230). Such allele specific oligonucleotides are 

2 5 hybridized to PCR amplified target DNA or a number of different mutations 

when the oligonucleotides are attached to the hybridizing membrane and 
hybridized with labeled target DNA. 

Alternatively, allele specific amplification technology which depends 
on selective PCR amplification may be used in conjunction with the instant 

3 0 invention. Oligonucleotides used as primers for specific amplification may earn 
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the mutation of interest in the center of the molecule (so that amplification 
depends on differential hybridization) (Gibbs et al. ( 1 989) Nucleic Acids Res. 
1 7:2437-2448) or at the extreme 3* end of one primer where, under appropriate 
conditions, mismatch can prevent, or reduce polymerase extension (Prossner 
5 ( 1 993) Tibtech 1 1 :238). In addition, it may be desirable to introduce a novel 
restriction site in the region of the mutation to create cleavage-based detection 
(Gasparini et al. (1992) Mol. Cell Probes 6: 1 ). It is anticipated that in certain 
embodiments amplification may also be performed using Taq ligase for 
amplification (Barany (1991 ) Proc. Natl. Acad. Sci USA 88:189). In such cases. 
10 ligation will occur only if there is a perfect match at the 3' end of the 5' sequence 
making it possible to detect the presence of a known mutation at a specific site bv 
looking for the presence or absence of amplification. 

The methods described herein may be performed, for example, by- 
utilizing pre-packaged diagnostic kits comprising at least one probe nucleic acid 
-15 or antibody reagent described herein, which may be conveniently used, e.g., in 
clinical settings to diagnose patients exhibiting symptoms or family history of a 
disease or illness involving a CARD-3. CARD-4, CARD-5. or CARD-6 gene. 

Furthermore, any cell type or tissue, preferably peripheral blood 
leukocytes, in which CARD-3 or CARD-4 is expressed may be utilized in the 
2 0 prognostic assays described herein. 

3 . Pharmacogenomics 

Agents, or modulators which have a stimulatory or inhibitory effect on 
CARD-3. CARD-4, CARD-5. or CARD-6 activity (e.g.. CARD-3. CARD-4. 
CARD-5. or CARD-6 gene expression) as identified by a screening assay 

2 5 described herein can be administered to individuals to treat (prophylactically or 

therapeutically) disorders (e.g., an immunological disorder) associated with 
aberrant CARD-3. CARD-4. CARD-5, or CARD-6 activity, in conjunction with 
such treatment, the pharmacogenomics (i.e., the study of the relationship between 
an individual's genotype and that individual's response to a foreign compound or 

3 G drug) of the individual may be considered. Differences in metabolism of 
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therapeutics can lead to severe toxicity or therapeutic failure by altering the 
relation between dose and blood concentration of the pharmacologically active 
drug. Thus, the pharrnacogenomics of the individual permits the selection of 
effective agents (e.g., drugs) for prophylactic or therapeutic treatments based on a 
5 consideration of the individual's genotype. Such pharrnacogenomics can further 
be used to determine appropriate dosages and therapeutic regimens. 
Accordingly, the activity of CARD-3. CARD-4, CARD-5. or CARD-6 protein, 
expression of CARD-3. CARD-4. CARD-5, or CARD-6 nucleic acid, or 
mutation content of CARD-3, CARD-4, CARD-5, or CARD-6 genes in an 

1 C individual can be determined to thereby select appropriate agent(s) for therapeutic 
or prophylactic treatment of the individual. 

Pharrnacogenomics deals with clinically significant hereditary 
variations in the response to drugs due to altered drug disposition and abnormal 
action in affected persons. See. e.g.. Linder (1997) Clin. Chem. 43(2):254-266. 

15 In general, two types of pharmacogenetic conditions can be differentiated. 

Genetic conditions transmitted as a single factor altering the way drugs act on the 
bodv (altered drug action) or genetic conditions transmitted as single factors 
altering the way the body acts on drugs (altered drug metabolism). These 
pharmacogenetic conditions can occur either as rare defects or as polymorphisms. 

2 0 For example, glucose-6-phosphate dehydrogenase deficiency (G6PD) is a 
common inherited enzymopathy in which the main clinical complication is 
haemolysis after ingestion of oxidant drugs (anti-malarials, sulfonamides, 
analgesics, nitrofurans) and consumption of fava beans. 

As an illustrative embodiment, the activity of drug metabolizing 

2 5 enzymes is a major determinant of both the intensity and duration of drug action. 

The discovery of genetic polymorphisms of drug metabolizing enzymes (e.g.. N- 
acetyltransferase 2 (NAT 2) and cytochrome P450 enzymes CYP2D6 and 
CYP2C19) has provided an explanation as to why some patients do not obtain the 
expected drug effects or show exaggerated drug response and serious toxicity 

3 0 after taking the standard and safe dose of a drug. These polymorphisms are 
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expressed in two phenotypes in the population, the extensive metabolizer (EM) 
and poor metabolizer (PM). The prevalence of PM is different among different 
populations. For example, the gene coding for CYP2D6 is highly polymorphic 
and several mutations have been identified in PM which all lead to the absence 
5 of functional CYP2D6. Poor metabolizers of CYP2D6 and CYP2C 19 quite 
frequently experience exaggerated drug response and side effects when they 
receive standard doses. If a metabolite is the active therapeutic moiety, PM show 
no therapeutic response, as demonstrated for the analgesic effect of codeine 
mediated by its CYP2D6-formed metabolite morphine. The other extreme are 

1 0 the so called ultra-rapid metabolizers who do not respond to standard doses. 
Recently, the molecular basis of ultra-rapid metabolism has been identified to be 
due to CYP2D6 gene amplification. 

Thus, the activity of CARD-3. CARD-4, CARD-5, or CARD-6 
protein, expression of CARD-3 or CARD-4 nucleic acid, or mutation content of 

1 5 CARD-3. CARD-4, CARD-5. or CARD-6 genes in an individual can be 

determined to thereby select appropriate agent(s) for therapeutic or prophylactic 
treatment of the individual. In addition, pharmacogenetic studies can be used to 
apply genotyping of polymorphic alleles encoding drug-metabolizing enzymes to 
the identification of an individual's drug responsiveness phenotvpe. This 

2 0 knowledge, when applied to dosing or drug selection, can avoid adverse reactions 
or therapeutic failure and thus enhance therapeutic or prophylactic efficiency 
when treating a subject with a CARD-3. CARD-4. CARD-5, or CARD-6 
modulator, such as a modulator identified by one of the exemplarv screening 
assays described herein. 

2 5 4. Monitoring of Effects During Clinical Trials' 

Monitoring the influence of agents (e.g.. drugs, compounds) on the 
expression or activity of CARD-3. CARD-4. CARD-5. or CARD-6 (e.g.. the 
ability to modulate aberrant cell proliferation and/or differentiation) can be 
applied not only in basic drug screening, but also in clinical trials. For example. 

3 0 the effectiveness of an agent determined by a screening assay as described herein 
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to increase CARD-3. CARD-4, CARD-5. or CARD-6 gene expression, protein 
levels, or upregulate CARD-3. CARD-4, CARD-5. or CARD-6 activity, can be 
monitored in clinical trails of subjects exhibiting decreased CARD-3, CARD-4, 
CARD-5. or CARD-6 gene expression, protein levels, or downregulated CARD- 
5 3, CARD-4. CARD-5, or CARD-6 activity. Alternatively, the effectiveness of an 
agent determined by a screening assay to decrease CARD-3. CARD-4. CARD-5. 
or CARD-6 gene expression, protein levels, or downregulated CARD-3. CARD- 
4. CARD-5. or CARD-6 activity, can be monitored in clinical trials of subjects 
exhibiting increased CARD-3. CARD-4. CARD-5. or CARD-6 gene expression. 

1 0 protein levels, or upregulated CARD-3. CARD-4. CARD-5. or CARD-6 activity. 
In such clinical trials, the expression or activity of CARD-3. CARD-4. CARD-5. 
or CARD-6 and. preferably, other genes that have been implicated in. for 
example, a cellular proliferation disorder can be used as a "read out" or markers 
of the immune responsiveness of a particular cell. 

15 For example, and not by way of limitation, genes, including CARD-3, 

CARD-4. CARD-5. or CARD-6. that are modulated in cells by treatment with an 
agent (e.g.. compound, drug or small molecule) which modulates CARD-3. 
CARD-4. CARD-5. or CARD-6 activity (e.g.. identified in a screening assay as 
described herein) can be identified. Thus, to study the effect of agents on cellular 

2 0 proliferation disorders, for example, in a clinical trial, cells can be isolated and 
RNA prepared and analyzed for the levels of expression of CARD-3. CARD-4. 
CARD-5, or CARD-6 and other genes implicated in the disorder. The levels of 
gene expression (i.e.. a gene expression pattern) can be quantified by Northern 
blot analysis or RT-PCR. as described herein, or alternatively by measuring the 

2 5 amount of protein produced, by one of the methods as described herein, or by- 

measuring the levels of activity of CARD-3. CARD-4. CARD-5. or CARD-6 or 
other genes. In this way. the gene expression pattern can serve as a marker, 
indicative of the physiological response of the cells to the agent. Accordingly, 
this response state may be determined before, and at various points during. 

3 0 treatment of the individual with the agent. 
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In an embodiment, the present invention provides a method for 
monitoring the effectiveness of treatment of a subject with an agent (e.g., an 
agonist, antagonist, peptidomimetic, protein, peptide, nucleic acid, small 
molecule, or other drug candidate identified by the screening assays described 
5 herein) comprising the steps of (i) obtaining a pre-administration sample from a 
subject prior to administration of the agent; (ii) detecting the level of expression 
of a CARD-3, CARD-4, CARD-5, or CARD-6 protein, mRNA, or genomic DNA 
in the preadministration sample; (iii) obtaining one or more post-administration 
samples from the subject; (iv) detecting the level of expression or activity of the 

1 0 CARD-3, CARD-4. CARD-5, or CARD-6 protein, mRNA, or genomic DNA in 
the post-administration samples; (v) comparing the level of expression or activity 
of the CARD-3, CARD-4. CARD-5. or CARD-6 protein. mRNA. or genomic 
DNA in the pre-administration sample with the CARD-3. CARD-4. CARD-5. or - 
CARD-6 protein. mRNA, or genomic DNA in the post administration sample or 

1 5 samples; and (vi) altering the administration of the agent to the subject 

accordingly. For example, increased administration of the agent may be desirable 
to increase the expression or activity of CARD-3. CARD-4, CARD-5. or CARD- 
6 to higher levels than detected, i.e.. to increase the effectiveness of the agent. 
Alternatively, decreased administration of the agent may be desirable to decrease 

2 0 expression or activity of CARD-3. CARD-4. CARD-5, or CARD-6 to lower 
levels than detected, i.e.. to decrease the effectiveness of the agent. 
5. Transcriptional Profiling 

The CARD-3. CARD-4. CARD-5. and CARD-6 nucleic acid 
molecules described herein, including small oligonucleotides, can be used in 

2 5 transcriptionally profiling. For example, these nucleic acids can be used to 

examine the expression of CARD-3. CARD-4, CARD-5. and CARD-6 in normal 
tissue or cells and in tissue or cells subject to a disease state, e.g.. tissue or cells 
derived from a patient having a disease of interest or cultured cells which model 
or retlect a disease state of interest, e.g.. cells of a cultured tumor cell line. By 

3 0 measuring expression of CARD-3. CARD-4. CARD-5. and CARD-6. together or 
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individually, a profile of expression in normal and disease states can be 
developed. This profile can be used diagnostically and to examine the 
effectiveness of a therapeutic regime. 

5 C Methods of Treatment 

The present invention provides for both prophylactic and therapeutic 
methods of treating a subject at risk of (or susceptible to) a disorder or having a 
disorder associated with aberrant CARD-3. CARD-4, CARD-5, or CARD-6 
expression or activity, examples of which are provided herein. 

10 1 . Prophylactic Methods 

In one aspect, the invention provides a method for preventing in a 
subject, a disease or condition associated with an aberrant CARD-3. CARD-4. 
CARD-5. or CARD-6 expression or activity, by administering to the subject an 
agent which modulates CARD-3. CARD-4. CARD-5, or CARD-6 expression or 

15 at least one CARD-3, CARD-4, CARD-5. or CARD-6 activity. Subjects at risk 
for a disease which is caused or contributed to by aberrant CARD-3. CARD-4. 
CARD-5. or CARD-6 expression or activity can be identified by. for example, 
any or a combination of diagnostic or prognostic assays as described herein. 
Administration of a prophylactic agent can occur prior to the manifestation of 

2 0 symptoms characteristic of the CARD-3. CARD-4. CARD-5. or CARD-6 

aberrancy, such that a disease or disorder is prevented or. alternatively, delayed in 
its progression. Depending on the type of CARD-3. CARD-4. CARD-5. or 
CARD-6 aberrancy, for example, a CARD-3. CARD-4. CARD-5. or CARD-6 
agonist or CARD-3. CARD-4, CARD-5. or CARD-6 antagonist agent can be 

2 5 used for treating the subject. The appropriate agent can be determined based on 
screening assays described herein. Activities of CARD-3. CARD-4. CARD-5. or 
CARD-6 that could be modulated for prophylactic purposes include, but are not 
limited to: 1 ) CARD-3. CARD-4. CARD-5. or CARD-6 gene or protein 
expression, for example, sec Example 1 1 for a description of the mRNA 
SO expression pattern of human CARD-4: 2)CARD-3. CARD-4. CARD-5. or 
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CARD-6 binding to a target protein, for example, see Examples 7. 8, and 1 2 for a 
description of proteins known to bind to CARD-3 or CARD-4; 3 ) CARD-4 
regulation of NF-kB as described in Example 9; and 4) CARD-3 and CARD-4 
enhancement of caspase 9 activity as described in Example 10. 
5 2. Therapeutic Methods 

Another aspect of the invention pertains to methods of modulating 
CARD-3. CARD-4, CARD-5, or CARD-6 expression or activity for therapeutic 
purposes. The modulatory method of the invention involves contacting a cell 
with an agent that modulates one or more of the activities of CARD-3. CARD-4. 

10 CARD-5, or CARD-6 protein activity associated with the cell. An auent that 
modulates CARD-3, CARD-4. CARD-5. or CARD-6 protein activity can be an 
agent as described herein, such as a nucleic acid or a protein, a naturally- 
occurring cognate ligand of a CARD-3. CARD-4. CARD-5. or CARD-6 protein, 
a peptide, a CARD-3, CARD-4. CARD-5. or CARD-6 peptidomimetic, or other 

15 small molecule. In one embodiment, the agent stimulates one or more of the 
biological activities of CARD-3. CARD-4. CARD-5, or CARD-6 protein. 
Examples of such stimulatory agents include active CARD-3. CARD-4. CARD- 
5, or CARD-6 protein and a nucleic acid molecule encoding CARD-3. CARD-4. 
CARD-5. or CARD-6 thai has been introduced into the cell. In another 

2 0 embodiment, the agent inhibits one or more of the biological activities of C ARD- 
3, CARD-4. CARD-5. or CARD-6 protein. Examples of such inhibitor}' agents 
include antisense CARD-3. CARD-4. CARD-5. or CARD-6 nucleic acid 
molecules and anti-CARD-3. CARD-4. CARD-5. or CARD-6 antibodies. These 
modulatory methods can be performed in vitro (e.g.. by culturing the cell with the 

2 5 agent) or. alternatively, in vivo (e.g. by administering the agent to a subject). As 

such, the present invention provides methods of treating an individual afflicted 
with a disease or disorder characterized by aberrant expression or activity of a 
CARD-3. CARD-4. CARD-5. or CARD-6 protein or nucleic acid molecule or a 
disorder related to CARD-3, CARD-4. CARD-5 or CARD-6 expression or 

3 0 activity. In one embodiment, the method involves administering an agent (e.u.. 
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an agent identified by a screening assay described herein), or combination of 
agents that modulates (e.g., upregulates or downregulates) CARD-3, CARD-4. 
CARD-5, or CARD-6 expression or activity. In another embodiment, the method 
involves administering a CARD-3. CARD-4, CARD-5. or CARD-6 protein or 
5 nucleic acid molecule as therapy to compensate for reduced or aberrant CARD-3, 
CARD-4, CARD-5. or CARD-6 expression or activity. Activities of CARD-3. 
CARD-4, CARD-5, or CARD-6 that could be modulated for therapeutic purposes 
include, but are not limited to, 1 ) CARD-3, CARD-4, CARD-5. or CARD-6 gene 
or protein expression, for example, see Example 1 1 for a description of the 

10 mRNA expression pattern of human CARD-4; 2) CARD-3. CARD-4. CARD-5. 
or CARD-6 binding to a target protein, for example, see Examples 7. 8. and 12 
for a description of proteins known to bind to CARD-3 or CARD-4; 3) CARD-4 
regulation of NF-kB as described in Example 9; and 4) CARD-4 enhancement of 
caspase 9 activity as described in Example 10. 

15 Stimulation of CARD-3, CARD-4. CARD-5. or CARD-6 activity is 

desirable in situations in which CARD-3. CARD-4. CARD-5. or CARD-6 is 
abnormally downregulated and/or in which increased CARD-3, CARD-4. 
CARD-5, or CARD-6 activity is likely to have a beneficial effect. Conversely, 
inhibition of CARD-3. CARD-4, CARD-5. or CARD-6 activity is desirable in 

2 0 situations in which CARD-3. CARD-4, CARD-5. or CARD-6 is abnormally 
upregulated. e.g., in myocardial infarction, and/or in which decreased CARD-3. 
CARD-4. CARD-5. or CARD-6 activity is likely to have a beneficial effect. 
Since CARD-4 may play be involved in the processing of cytokines, inhibiting 
the activity or expression CARD-4 may be beneficial in patients that have 

2 5 aberrant inflammation. 

This invention is further illustrated by the following examples which 
should not be construed as limiting. The contents of all references, patents and 
published patent applications cited throughout this application are hereby 
incorporated by reference. 
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EXAMPLES 

Example 1 ; Isolation and Characterization of full- length Human 
CARD-3 and CARD-4L/S cDNAs. 

A profile of known CARD domains was used to search databases of 
5 cDNA sequences and partial cDNA sequences using TBLASTN (Washington 
University; version 2.0, BLOSUM62 search matix). This search led to the 
identification of CARD-3. Using CARD-3 to search databases of cDNA 
sequences and partial cDNA sequences, another potential CARD cDNA was 
found. This cDNA sequence was used screen a human umbilical vein endothelial 
10 library (HUVE) and a clone containing the partial CARD-4S was identified. The 
human umbilical vein endothelial library was then rescreened using a probe 
designed against the partial CARD-4S sequence and a clone containing the 
CARD-4L sequence was identified. 

15 Example 2: Characterization of CARD-3 AND CARD-4L/S Proteins. 

In this example, the predicted amino acid sequences of human CARD- 
3 and CARD-4L/S proteins were compared to amino acid sequences of known 
proteins and various motifs were identified. For example, the CARD domains of 
CARD-3 and CARD-4 were aligned (Figure 7) with the CARD domains of ARC- 

20 CARD (SEQ ID NO:3 1 ). clAPl-CARD (SEQ ID NO:32) and clAP2-CARD 
(SEQ ID NO:33). In addition, the molecular weight of the human CARD-3 and 
CARD-4L/S proteins were predicted. 

The human CARD-3 cDNA was isolated as described above (Figure 
1 : SEQ ID NO: I ) and encodes a 540 amino acid protein (Figure 2: SEQ ID 

2 5 NO:2). CARD-3 also includes one predicted kinase domain (amino acid 1 to 

amino acid 300 of SEQ ID NO:2; SEQ ID NO:4). which is followed by a 
predicted linker domain (amino acid 301 10 amino acid 431 of SEQ ID NO:2: 
SEQ ID NO:5) and a predicted CARD domain (amino acid 432 to amino acid 
540 of SEQ ID NO:2: SEQ ID NO:6>. 

3 G The human CARD-4L cDN A was isolated as described above ( Figure 
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3; SEQ ID NO:7) and has a 2859 nucleotide open reading frame (nucleotides 
245-3103 of SEQ ID NO:7; SEQ ID NO:9) which encodes a 953 ammo acid 
protein (Figure 4; SEQ ID NO:8). CARD-4L protein has a predicted CARD 
domain (amino acids 15-1 14; SEQ ID NO: 10). CARD-4L is also predicted to 
have a nucleotide binding domain which extends from about amino acid 198 to 
about amino acid 397 of SEQ ID NO:8; SEQ ID NO:l 1, a predicted Walker Box 
"A", which extends from about amino acid 202 to.about amino acid 209 of SEQ 
ID NO:8; SEQ ID NO: 12, a predicted Walker Box "B", which extends from 
about amino acid 280 to about amino acid 284. of SEQ ID NO:8: SEQ ID NO: 13. 
a predicted kinase la (P-loop) domain, which extends from about amino acid 197 
to about amino acid 212 of SEQ ID NO:8; SEQ ID NO:46. a predicted kinase 2 
domain, which extends from about amino acid 273 to about amino acid 288 of 
SEQ ID NO:8; SEQ ID NO:47. a predicted kinase 3a subdomain. which extends 
from about amino acid 327 to about amino acid 338 of SEQ ID NO:8; SEQ ID 
NO: 14. ten predicted Leucine-rich repeats which extend from about amino acid 
674 to about amino acid 950 of SEQ ID NO:8. The first Leucine-nch repeat is 
predicted to extend from about amino acid 674 to about amino acid 701 of SEQ 
ID NO:8: SEQ ID NO: 15. The second Leucine-rich repeat is predicted to extend 
from about amino acid 702 to about amino acid 727 of SEQ ID NO:8; SEQ ID 
NO: 16. The third Leucine-rich repeat is predicted to extend from about amino 
acid 728 to about amino acid 754 of SEQ ID NO:8; SEQ ID NO: 1 7. The fourth 
Leucine-rich repeat is predicted to extend from about amino acid 755 to about 
amino acid 7S2 of SEQ ID NO:8: SEQ ID NO: 1 8. The fifth Leucine-rich repeat 
is predicted to extend from about amino acid 783 to about amino acid 810 of SEQ 
ID NO:8: SEQ ID NO: 19. The sixth Leucine-rich repeat is predicted to extend 
from about amino acid 81 1 to about amino acid 838 of SEQ ID NO:8: SEQ ID 
NO:20. The seventh Leucine-rich repeat is predicted to extend from about amino 
acid 839 to about amino acid 866 of SEQ ID NO:S: SEQ ID NO:21 . The eighth 
Leucine-rich repeat is predicted to extend from about amino acid 867 to about 
amino acid 894 of SEQ ID NO:8: SEQ ID NO:22. The ninth Leucine-rich repeat 
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is predicted to extend from about amino acid 895 to about amino acid 922 of 
SEQ ID NO:8; SEQ ID NO:23 and the tenth Leucine-rich repeat is predicted to 
extend from about amino acid 923 to about amino acid 950 of SEQ ID NO:8: 
SEQ ID NO:24. 

5 The human partial CARD-4S cDNA isolated as described above 

(Figure 5; SEQ ID NO:25) encodes a 490 amino acid protein (Figure 6: SEQ ID 
NO:26). CARD-4S includes one predicted partial CARD domain (amino acids 1 - 
74 of SEQ ID NO:26). CARD-4S is also predicted to have a P-Loop which 
extends from about amino acid 163 to about amino acid 170 of SEQ ID NO:26; 

1 0 SEQ ID NJO:29. and a predicted Walker Box "B" which extends form about 
amino acid 241 to about amino acid 245 of SEQ ID NO:26: SEQ ID NO:30. 

A plot showing the predicted structural features of CARD-4L is 
presented in Figure 8. This figure shows the predicted alpha regions (Gamier- 
Robinson and Chou-Fasman), the predicted beta regions (Garnier-Robinson and 

15 Chou-Fasman), the predicted turn regions (Garnier-Robinson and Chou-Fasman) 
and the predicted coil regions (Garnier-Robinson and Chou-Fasman). Also 
included in the figure is a hydrophilicity plot (Kyte-Doolittle). the predicted alpha 
and beta-amphatic regions (Eisenberg). the predicted flexible regions (Karplus- 
Schulz), the predicted antigenic index (Jameson- Wolf) and the predicted surface 

2 0 probability plot (Emini). 

A plot showing the predicted sturctural features of CARD-4S is also 
presented in Figure 9. This figure shows the predicted alpha regions (Garnier- 
Robinson and Chou-Fasman). the predicted beta regions (Garnier-Robinson and 
Chou-Fasman). the predicted turn regions (Garnier-Robinson and Chou-Fasman) 

2 5 and the predicted coil regions (Garnier-Robinson and Chou-Fasman). Also 

included in the figure is a hydrophilicity plot (Kyte-Doolittle). the predicted alpha 
and beta-amphatic regions (Eisenberg). the predicted flexible regions (Karplus- 
Schulz). the predicted antigenic index (Jameson- Wolf) and the predicted surface 
probability plot (Emini). 

3C 
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The predicted MW of CARD-3 is approximately 61 kDa. The 
predicted MW of CARD-4L is approximately 108 kDa. 

Example 3: Preparation of CARD-3 and CARD-4 Proteins 
5 Recombinant CARD-3 and CARD-4 can be produced in a variety of 

expression systems. For example, the CARD-3 and CARD-4 peptides can be 
expressed as a recombinant glutathione-S-transferase (GST) fusion protein in E. 
coli and the fusion- protein can be isolated and characterized. Specifically, as 
described above. CARD-3 or CARD-4 can be fused to GST and the fusion 

1 0 protein can be expressed in E. coli strain PEB199. Expression of the GST- 

CARD-3 or GST-CARD-4 fusion protein in PEB199 can be induced withlPTG. 
The recombinant fusion protein can be purified from crude bacterial lysates of the 
induced PEB199 strain by affinity chromatography on glutathione beads. 

15 Example 4: Identification of splice variants of CARD-4. 

The 5' untranslated sequence from CARD-4L was used to search 
databases of cDNA sequences and partial cDNA sequences using BLASTN 
(Washington University; version 2.0, BLOSUM62 search matrix) for additional 
CARD-4 cDNA clones. This search led to the identification of two cDNA 

2 0 clones, clone Z from a human lymph node library and the Y clone from a human 

brain cDNA library. Both clones were sequenced and found to represent 
probable splice variants of CARD-4 that encode truncated CARD-4 proteins. Y 
encoding a 249 amino acid protein and Z encoding a 164 amino acid protein. 
Fig. 10 shows the nucleotide (SEQ ID NO:38) and Fig. 1 1 the predicted amino 

2 5 acid (SEQ ID NO:39) sequences of human CARD-4 Y: Fig. 12 shows the 

nucleotide (SEQ ID NO:40) and Fig. I 3 the amino acid (SEQ ID NO:4 1 ) 
sequences of human CARD-4Z: and Fig. 14 shows an alignment of the CARD- 
4L. CARD-4Y, and CARD-4Z amino acid sequences generated by the Clustal 
program using a PAM250 residue weight table. 

3 0 
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Example 5: Identification of murine CARD-4. 

The CARD-4 polypeptide sequence was used to search databases of 
cDNA sequences and partial cDNA sequences using the TBLASTN program 
(version 1 .4, BLOSUM62 search matrix, and a word length of 3) for murine 
5 CARD-4 cDN A clones. This search led to the identification of a partial murine 
CARD-4 clone designated murine CARD-4L. The rapid identification of cDNA 
ends procedure (RACE) was applied to the 5' end of the murine CARD-4L clone 
lo elucidate the 5' end of the murine CARD-4L cDN A. Fig. 1 5 shows the murine 
CARD-4L nucleotide sequence (SEQ ID NO:42). Figure 16 shows the murine 
1 0 CARD-4L amino acid sequence (SEQ ID NO:43), and Fig. 1 7 shows an 

alignment of the murine CARD-4L and human CARD-4L amino acid sequences 
generated by the Clustal program using a PAM250 residue weight table. 

Example 6: Identification of the chromosomal location of human CARD-4. 

15 To determine the chromosomal location of the human CARD-4 gene, 

the polymerase chain reaction carried out with human CARD-4-specific primers 
card4t, with the 5' to 3' sequence agaaggtctggtcggcaaa (SEQ ID NO:44). and 
card4k. with the 5' to 3' sequence aagccctgagtggaagca (SEQ ID NO:45). was used 
to screen DNAs from a commercially available somatic cell hybrid panel. This 

2 0 analysis showed that human CARD-4 maps lo chromosome 7 close to the SHGC- 
31928 genetic marker. 

Example 7: Identification of CARD-3 in a yeast two-hybrid screen for proteins 
that physically interact with the CARD domain of human CARD-4. 

2 5 DNA encoding amino acids 1-145 of human CARD-4 comprising the 

CARD domain was cloned into a yeast two-hybrid screening vector to create a 
CARD-4 J-145-GAL4 DNA-binding domain fusion for two-hybrid screening. 
The CARD-4, 1-145-GAL4 DNA-binding domain fusion was used to screen 
human mammary gland and human prostate two-hybrid libraries for gene 

2 0 products that could physically associate with CARD-4. 1-145. Twelve library 

- 128- 

BNSDOCiC' <WC Ci006?6A2 ; > 



WO 01/00826 



PCT/US00/17691 



plasmids expressing CARD4, 1-145 interacting proteins were found to contain the 
CARD-domain containing protein CARD-3 thus establishing a direct or indirect 
physical interaction between CARD-4 and CARD-3. 

In addition. DNA encoding amino acids 435-540 of CARD-3 
5 comprising the CARD domain of CARD-3 (SEQ ID NO:6) was cloned into a 
yeast two-hybrid GAL4 transcriptional activation domain fusion vector to create 
a CARD-3.435-540-GAL4 transcriptional activation domain fusion. To test 
whether the CARD domain of CARD-3 binds CARD-4, 1-145. the CARD-3,435- 
540-GAL4 transcriptional activation domain fusion expression vector and,the 

1 1 CARD-4. 1-I45-GAL4 DNA-binding domain fusion vector were eolransformed 
into a two-hybrid screening Saccharomyces cerevisiae (yeast) strain. The 
resulting cotransformed yeast strain expressed the two reporter genes that 
indicate a physical interaction between the two hybrid proteins in the experiment, 
in this case, the CARD-3,435-540-GAL4 transcriptional activation domain fusion 

15 protein and the CARD-4.1-145-GAL4 DNA-binding domain fusion protein. This 
experiment established a physical interaction between the CARD domain of 
CARD-3 and the CARD domain of CARD-4. 

Example 8: Identification of hNUDC in a yeast two-hybrid screen for proteins 
2 0 that physically interact with the LRR domain of human CARD-4. 

DNA encoding amino acids 406-953 of human CARD-4L comprising 
the LRR domain was cloned into a yeast two-hybrid screening vector to create a 
CARD-4.406-953-GAL4 DNA-binding domain fusion for two-hybrid screening. 
The CARD-4.406-953-GAL4 DNA-binding domain fusion was used to screen a 
2 5 human mammary gland two-hybrid library for gene products that could 

physically associate with CARD-4.406-953. One library plasmid expressing a 
CARD-4.406-953 interacting protein was found to contain the hNUDC protein, 
the human ortholog of the rat NUDC protein that has been implicated in nuclear 
movement (Moms et aL Curr. Biol. 8:603 f 1998 J. Morris et al.. Exp. Cell Res. 

3C 
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238:23 [1998]), thus establishing a physical interaction between CARD-4 and 
hNUDC. 

Example 9: Discovery of regulation by CARD-4 of NF-kB. 
5 The first group of experiments described in this Example were carried 

out to determine if CARD-4 can activate the NF-kB pathway. CARD-4 
regulation of the NF-kB pathway is of interest because the NF-kB pathway is 
involved in many diseases described in (New England Journal of Medicine 
336:1066 [1997]) and (American Journal of Cardiology 76:I8C [1995J) and other 
10 references known to those skilled in the art. Participation of CARD-4 in the NF- 
kB pathway would make CARD-4 an attractive target for drugs that modulate the 
NF-kB pathway for treatment of NF-kB pathway-dependent diseases, conditions, 
and biological processes. 

The first group of experiments showed specific CARD-4-mediated 
15 NF-kB pathway induction. 

The second group of experiments described in this Example were 
carried out to determine if CARD-3. the NIK serine/threonine protein kinase (Su 
et aL EMBO J. 16:1279 [1997]), or the signal transduction protein TRAF6 (Cao 
et aL Nature 383:443 [1996]). proteins known to participate in the induction of 

2 0 NF-kB (McCarthy et aL J. Biol. Chem. 273:16968 [1998]). are involved in 

transducing the CARD-4-dependent NF-kB pathway induction signal. It was 
found that CARD-3. NIK. and TRAF6 are all involved in transducing the CARD- 
4-mediated NF-kB pathway induction signal. 

In nine transfection experiments. 293T cells coexpressing an NF-kB 
2 5 reporter plasmid and either pCl. pCI-CARD-4L (expressing CARD-4L). pCI- 
CARD-4S {expressing CARD-4S). pCl-APAFL (expressing Apaf-1 ). pCl- 
APAFS (expressing an Apaf-1 variant lacking WD repeats), pCI-CARD- 
4LnoCARD (expressing CARD-4L without a CARD domain). pCl- 
CARD4LnoLRR (expressing CARD-4L without a LRR). pCI- 

3 0 CARD4LCARDonly (expressing CARD-4L CARD domain only), or pCI- 
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CARD4NBSonly (expressing CARD-4L nucleotide binding sequence only) were 
created. 293T cells cells were plated in 6-well plates (35 mm wells) and 
transfected 2 days later (90% confluency) with 1 j^g of NF-kB luciferase reporter 
plasmid (pNF-KB-Luc. Stratagene), 200 ng of pCMV JJ-gal, 600 ng of pCI vector 
5 and 200 ng of indicated expression plasmids using SuperFect transfection reagent 
(Qiagen). For dominant-negative experiments, 2 ng of CARD4 expressing 
plasmid and 800 ng of dominant-negative plasmid were used. Cells were 
harvested 48 h after transfection and luciferase activity in 1000-fold diluted cell 
extracts was determined using the Luciferase Assay System (Promega). In 
10 addition, B-galactosidase activities were determined and used to normalize 
transfection efficiency. 

Relative luciferase activity was determined at the end of the 
experiment to assess NF-kB pathway activation by the gene expressed by the 
pCI-based plasmid in each transfected cell line. The cell lines containing pCL 
1 5 pCI-APAFS. pCI-APAFL ? P CI-CARD-4LnoCARD. and pCI-C ARD4NBSonly 
had similar baseline levels of luciferase expression but the cell lines containing 
pCI-CARD-4L. P CI-CARD4LnoLRR, and pCI-CARD4LCARDonly had 
. luciferase expression about nine fold elevated relative to baseline and the cell line 
containing pCI-CARD4S had luciferase expression sixteen fold elevated relative 
2 0 to baseline. This result demonstrates induction by CARD-4S and CARD-4L of 
the NF-kB pathway. This CARD-4 mediated NF-kB pathway induction is 
dependent on the CARD-4 CARD domain because the pCI-CARD-4noCARD 
construct expressing CARD-4 lacking its CARD domain did not induce the 
luciferase reporter gene and pCl-CARD4LCARDonly expressing the CARD-4 

2 5 CARD domain did induce the luciferase reporter gene. Also, the CARD-4 LRR 

domains are not required for NF-kB pathway activation because pCI- 
CARD4LnoLRR expressing a CARD-4 mutant protein lacking LRR domains is 
able to induce the luciferase reporter gene. In addition, the CARD-4 NBS 
domain is not sufficient for NF-kB pathway activation because pCI- 

3 0 CARD4NBSonly expressing CARD-4 NBS domain is not able to induce the 
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luciferase reporter gene. In addition, the induction of the NF-kB pathway by 
CARD-4 is specific, as neither Apaf-expressing construct in this experiment 
induced luciferase activation. 

In five transfection experiments, 293T cells coexpressing an NF-kB 
5 reporter plasmid (NF-icB-luciferase, Stratagene) and pCI-CARD-4L and either, 
no vector, pCI-TRAF6-DN (expressing a dominant negative version of TRAF-6). 
pCl-NIK-DN (expressing a dominant negative version of NIK kinase), pCI- 
CARD3CARDonly (expressing the CARD domain of CARD-3, which acts as a 
dominant negative version of CARD-3). or pCI-Bcl-XL (expressing the anti- 

1 0 apoptotic protein Bcl-XL) were created. TRAF6-DN. NIK-DN, and CARD3- 
CARDonly are dominant negative alleles of the TRAF6. NIK. and CARD3 
genes, respectively. After 48 hours, cells were lysed and the relative luciferase 
activity was determined (Promega Kit) to assess NF-kB pathway activation by 
the genes expressed by the one or two pCl-based plasmids in each transfected cell 

1 5 line. The cell lines containing pCl-CARD-4L only or pCT-CARD-4L and pCI- 
Bcl-XL had relative luciferase reporter gene expression of about 1 8 units. The 
cell lines containing pCI-CARD-4L and pCI-TRAF6-DN, pCI-CARD-4L and 
pCI-NIK-DN. or pCI-CARD-4L and pCI-CARD3CARDonly had relative 
luciferase reporter gene expression of about 4 units. Inhibition of CARD-4L- 

2 0 mediated NF-kB pathway induction by TRAF6-DN, NIK-DN, and CARD- 
3CARDonly is specific as Bcl-XL did not inhibit CARD-4L-mediated NF-kB 
pathway induction. 

These results demonstrate that dominant negative alleles of TRAF6, 
NIK and CARD-3 expressed, respectively, from pCl-TRAF6-DN, pCl-NIK-DN. 

2 5 and pCI-CARD3CARDonly block induction of the NF-kB reponer gene by 

CARD-4L expression (pCl-CARD-4L) and suggest that TRAF6, NIK. and 
CARD-3 act downstream of CARD-4L to transduce the CARD-4L NF-kB 
pathway induction stimulus. 

in an additional experiment, coexpression of CARD-4 and the CARD 

3 C domain of CARD-3 revealed that the CARD domain of CARD-3 functions as a 
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dominant negative mutant suggesting that CARD-3 is a downstream mediator of 
CARD-4 function. 

Example 10: Discover)' of CARD-4 enhancement of caspase-9 activity. 
5 In ten transfection experiments, 293T cells coexpressing a beta 

galactosidase-expressing plasmid (pCMV P-gal from Stratagene) as a marker for 
viable cells and either pCl, pCI-CARD-3, pCI-APAF. pCl-CARD-4L, pCI- 
CARD-4S, pCI-CARD4LnoLRR, pCI-CARD4NBSonly. pCJ- 
CARD4LCARDonl\\ pCI-CARD-4LnoCARD or pCI-casp9 (expressing caspase- 

10 9) were created. Transfections included 400 ng of pCMV [3-gaL 800 ng of 
expression plasmid. and Superfect transfection reagent from Qiagen and were 
carried out according to the manufacturer's directions. After 40-48 hours, cells 
were fixed and stained for beta-galactosidase expression and cell viability was 
determined by counting the number of beta galactosidase positive cells. 

15 Expression of pCI, pCI-CARD-3, pCI-APAF. pCI-CARD-4L. pCl-CARD-4S, 
pCI-CARD4LnoLRR. pCl-CARD4NBSonly. pCl-CARD4LCARDonly. and pCI- 
CARD-4LnoCARD did not result in loss of cell viability. As expected, 
expression of pCl-casp9 in 293T cells resulted in a loss of viability of about 75% 
of the cells in the experiment. 

2 0 It was next tested whether pCL pCI-CARD-3. pCI-APAF. pCI- 

CARD-4L, pCI-CARD-4S. pCl-CARD4LnoLRR. pCI-CARD4NBSonly. pCI- 
CARD4LCARDonIy. or pCI-CARD-4LnoCARD would regulate caspase 9- 
mediated apoptosis. In nine transfection experiments. 2931" cells coexpressing a 
beta galactosidase-expressing plasmid as a marker for viable cells. pCI-casp9. 

2 5 and either pCL pCI-CARD-3. pCI-APAF. pCl-CARD-4L. pCI-CARD-4S. pCl- 

CARD4LnoLRR. pCI-CARD4NBSonly. pCI-CARD4LCARDonl>\ and pCI- 
CARD-4LnoCARD were created. After 40-48 hours, cells were fixed and 
stained for bcta-galactosidase expression and cell viability was determined by 
counting the number of beta galactosidase positive cells. Expression of pCL pCI- 

3 0 CARD-4LnoCARD. and pCI-CARD4NBSoniy in the caspase 9-expressing 293T 
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cells had no effect on the caspase 9-induced apoptosis. However, pCl-CARD-3, 
pCI-CARD-4L, pCI-CARD-4S, pCI-CARD4LnoLRR, pCI-CARD4LCARDonly 
and. as expected, pCI-APAF enhanced the level of caspase 9-induced apoptosis 
to 20 or less beta galactosidase positive cells per experiment from about 100 beta 
5 glactosidase positive cells per experiment. 

This experiment demonstrated that CARD-4 can enhance caspase 9- 
mediated apoptosis because coexpression of CARD-4L or CARD-4S with 
caspase-9 dramatically increases caspase-9 mediated apoptosis. Furthermore, the 
CARD-4 CARD domain (SEQ ID NO: 10) is necessary and sufficient for CARD- 

1 0 4-mediated enhancement of caspase-9-potentiated apoptosis because C ARD-4L 
lacking its CARD domain (pCI-CARD-4LnoCARD) does. not enhance caspase-9- 
mediated apoptosis while the CARD-4 CARD domain expressed alone (pCI- 
CARD4LCARDorrly) does induce caspase-9 mediated apoptosis. In addition, the 
LRR present in CARD-4 is not required for CARD-4 enhancement of caspase-9- 

15 mediated apoptosis because expression of a CARD-4 protein lacking the LRR 
(pCI-CARD4LnoLRR) still enhances caspase-9-mediated apoptosis. The CARD- 
4 NBS is not sufficient for CARD-4 enhancement of caspase-9-mediated 
apoptosis because expression of the CARD-4 NBS only (pCI-CARD4NBSonly) 
does not enhance caspase-9 mediated apoptosis. This experiment also 

2 0 demonstrates that CARD-3 can enhance caspase-9-mediated apoptosis. 

As detailed below in Example 12. CARD-4 does not appear to interact 
directly with caspase-9, suggesting that potentiation of caspase-9 activity by 
CARD-4 is mediated by activation of downstream pathways. 

2 5 Example 1 1 : Identification and tissue distribution of mRNA species expressed 

by the human CARD-4 gene. 

Northern analysis of mRNAs extracted from adult human tissues 
revealed a 4.6 kilobase mRNA band thai was expressed in most tissues examined. 
Highest expression was observed heart, spleen, placenta and lung. CARD-4 was 

3 0 also observed to be expressed in fetal brain, lung, liver and kidney. Cancer cell 
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lines expressing the 4.6 kilobase CARD-4 mRNA include HeLa, K562, Molt4. 
SW480. A549 and melanoma. A larger 6.5 to 7.0 kilobase CARD-4 mRNA was 
expressed in heart, spleen, lung, fetal lung, fetal liver, and in the Molt4 and 
SW480 cell lines. 

5 

Example 12: Physical association of CARD-4 with CARD-3. 

CARD-4-specifk PCR primers with the 3' primer encoding the HA 
epitope tag were used to amplify the CARD-4 L gene epitope tagged with HA and 
this PCR product was cloned into the mammalian expression vector pCI. CARD- 

10 3-specific PCR primers with the 5' primer encoding the MYC epitope tag were 
used to amplify the CARD-3 gene epitope tagged with MYC and this PCR 
product was cloned into the mammalian expression vector pCI. CARD-3-specific 
PCR primers with the 5' primer encoding the MYC epitope tag were used to 
amplify the CARD-3 gene lacking the CARD domain (SEQ ID NO:6) epitope 

15 tagged with MYC and this PCR product was cloned into the mammalian 
expression vector pCI. Caspase 9-specific PCR primers with the 3' primer 
encoding the MYC epitope tag were used to amplify the caspase 9 gene epitope 
tagged with MYC and this PCR product was cloned into the mammalian 
expression vector pCI. In three transfection experiments. 293T cells 

2 0 coexpressing pCI-CARD-4LcHA and either pCI-CARD3nM YC pCI- 

CARD3noCARDnMYC\ or pCI-casp9cMYC were created. Cells from each 
transfected line were lysed and an immunoprecipitation procedure was carried out 
on each lysate with an anti-M YC epitope tag antibody to precipitate the CARD- 
4LcHA expressed by each cell line and any physically associated proteins. 

2 5 lmmunoprecipitated proteins were separated by electrophoresis on denaturing 
polyacrylamide gels, transferred to nylon filters, and probed with an anti-HA 
epitope tag antibody in a Western blotting experiment to determine whether the 
MYC-tagged protein that was coexpressed with the CARD-4LcHA protein had 
eoimmunoprecipitated with the CARD-4LcHA protein. In this experiment. 

. 3 0 CARD-3 was found to coimmunoprecipitate with CARD-4 while CARD-3 
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. lacking its CARD domain and caspase-9 did not coimmunoprecipitate with 
CARD-4. This experiment demonstrates that CARD-4 and CARD-3 physically 
associate and that CARD-3 requires its CARD domain to associate with CARD- 
4. In addition, CARD-4 appears to not associate with caspase-9. 

5 

Example 13: CARD-4 Genomic Sequence 

Figure 1 8 is depicts the 32042 nucleotide genomic sequence of 
CARD-4 (SEQ ID NO:63). This sequence is based the CARD-4 cDNA sequence 
described above and a BAC sequence (DBEST Accession No. AC006027). The 

1 0 CARD-4 cDN A sequence described above was. used to correct three errors in the 
BAC sequence, including one error resulting in a frameshift. The CARD-4 
genomic sequence of Figure 18 includes the following introns and exons: exon 1 : 
nucleotides 364-685. encoding amino acids 1-67 (start codon at nucleotides 485- 
487); intron 1: nucleotides 686-2094; exon 2: nucleotides 2095-2269. encoding 

15 amino acids 67-126: intron 2: nucleotides 2270-4365; exon 3: nucleotides 366- 
6190. encoding amino acids 126-734; intron 3: nucleotides 6191-9024; exon 4: 
nucleotides 9025-9108. encoding amino acids 734-762: intron 4: nucleotides 
9109-10355: exon 5: nucleotides 10356-10439. encoding amino acids 762-790: 
intron 5: nucleotides 10440-1 1 181; exon 6: nucleotides 1 182- 11 265. encoding 

2 0 amino acids 790-818; intron 6: nucleotides 1 1266-19749; exon 7: nucleotides 

1 9750- 1 9833, encoding amino acids 8 1 8-846; intron 7: nucleotides 1 9834-2 1 324; • 
exon 8: nucleotides 21325-21408, encoding amino acids 846-874: intron 8: 
nucleotides 21409-24226; exon 9: nucleotides 24227-24310. amino acids 874- 
903; intron 9: nucleotides 2431 1-27948; exon 10: nucleotides 27949-28032, 
25 amino acids 903-930; intron 10: nucleotides 28033-31695: exon 1 1 : nucleotides 
31696-32024. encoding amino acids 930-953 (stop codon at nucleotides 31766- 
31768). 

The introns in the CARD-4 genomic sequence contain consensus 
splice donor and acceptor sites (Molecular Cell Biology. Darnell ei aL 

3 0 eds..!996). The-CARD-4 genomic sequence is useful for genetic identification 
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and mapping and identifying mutations, e.g., mutations is splice donor or splice 
acceptor sites. 

Example 14: Isolation and characterization of full-length murine CARD-5 and 

5 human CARD-5 

The amino acid sequence of the CARD domain of RAIDD (amino 
acids 1 to 94) was used to search a proprietary murine cDNA sequence database 
using the BLASTX program with the BLOSUM62 matrix and a protein word 
length of three. This search led to the identification of a murine clone, 
1 0 jtmaaO 1 0ht2, present in a coronary artery smooth muscle cell library. This clone 
encodes a protein designated CARD-5. The 761 nucleotide murine CARDo 
cDNA of SEQ ID NO:60 has a 579 nucleotide open reading frame (SEQ ID 
NO:62) encoding a 193 amino acid protein (SEQ ID NO:61 ). The cDNA and 
protein sequences of murine CARD-5 are shown in Figure 19. 
! 5 [Vlurine CARD-5 is predicted to be an intracellular protein having a 

molecular weight of 21 A kDa prior to post-translational modification. 

Figure 20 depicts a hydropathy plot of murine CARD-5. Relatively 
hydrophobic residues are above the dashed horizontal line, and relatively 
hydrophilic residues are below the dashed horizontal line. The cysteine residues 
2 0 (cvs) and potential N-glycosylation sites (Ngly) are indicated by short vertical 
lines just below the hydropathy trace. 

The murine CARD-5 nucleotide sequence was used to search a 
proprietary database of human cDNA sequences. This search led to the 
identification of a human CARD-5 cDNA clone. jthza027gl 111. present in a 

2 5 testes library. 

The 740 nucleotide murine CARD-5 cDN A of SEQ ID NO:48 has a 
585 nucleotide open reading frame (SEQ ID NO:50) encoding a 195 amino acid 
protein (SEQ ID NO:49). The cDNA and protein sequences of human CARD-5 
arc shown in Figure 21. 
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Human CARD-5 is predicted to be an intracellular protein having a 
molecular weight of 21.6 kDa prior to post-translational modification. 

Figure 22 depicts a hydropathy plot of human CARD-5. Relatively 
hydrophobic residues are above the dashed horizontal line, and relatively 
5 hydrophilic residues are below the dashed horizontal line. The cysteine residues 
(cys) and potential N-glycosylation sites (Ngly) are indicated by short vertical 
lines j ust below the hydropathy trace. 

Figure 23 depicts an alignment of the cDNA sequences of murine 
(SEQ ID NO:60) and human (SEQ. ID NO:48) CARD-5. In this alignment the 
1 0 sequences are 68.2% identical. Figure 24 depicts an alignment of the amino acid 
sequences of murine' (SEQ ID NO:61 ) and human (SEQ ID NO:49) CARD-5. In 
this alignment the sequences are 71.8% identical. 

Both murine and human CARD-5 include a CARD domain. The 
CARD domain of murine CARD-5 extends from amino acid 110 to 1 79 of SEQ 
15 ID NO:61 (SEQ ID NO:57). The CARD domain of human CARD-5 extends 
from amino acid 1 1 1 to 1 8 1 of SEQ ID NO:49 (SEQ ID NO:58). Figure 27 
depicts an alignment of the CARD domains of murine CARD-5 (SEQ ID 
NO:57), human CARD-5 (SEQ ID NO:58), and RAIDD (SEQ ID NO:70). 

2 0 Example 15: Isolation and Characterization of full-length rat CARD-6 and 
human CARD-6 

A generalized CARD domain model was used to search a proprietary 
rat cDNA sequence database. This search led to the identification of a rat cDNA 
clone present in a sciatic nerve cDNA library. This clone encodes a protein 

2 5 desigated CARD-6. The 5252 nucleotide rat CARD-6 cDNA of SEQ ID NO:5 1 

has a 2715 nucleotide open reading frame (SEQ ID NO:53) encoding a 905 
amino acid protein (SEQ ID NO:52"). The cDNA and protein sequences of rat 
CARD-6 are shown in Figure 25. 

Rat CARD-6 is predicted to be an intracellular protein having a 

3 0 molecular weight of 100.2 kDa prior to post-translaiional modification. 
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Figure 26 depicts a hydropathy plot of rat CARD-6. Relatively 
hydrophobic residues are above the dashed horizontal line, and relatively 
hydrophilic residues are below the dashed horizontal line. The cysteine residues 
(cys) and potential N-glycosylation sites (Ngly) are indicated by short vertical 
5 lines just below the hydropathy trace. 

Rat C ARD-6 contains a CARD domain which extends from amino 
acid 1 to amino acid 108 of SEQ IDNO:52 (SEQ ID NO:59). Rat CARD-6 also 
has a proline-rich c-terminus which extends from amino acid 698 to amino acid 
905 of SEQ ID NO:52 (SEQ ID NO:65). This proline-rich domain includes five 
10 putative SH3 binding sites. These binding sites have the sequence PXXP and arc 
located at amino acids 710 to 713 (PAHP). 806 to 809 (PLRP). 819 to 822 
(PIPP). 857 to 860 (PPHP). and 881 to 884 (PSQP) of SEQ ID NO:52. 

The rat CARD-6 cDNA sequence described above was used to search 
a proprietary sequence database. This search led to the identification of a clone 
15 from a human muscle cell library encoding a carboxy-terminal portion of human 
CARD-6. A probe designed based on the sequence of this clone was used to 
screen a human adrenal gland library. This screening led to the identification of a 
clone encoding an amino-terminal portion of human CARD-6. The clone 
encoding an amino terminal portion of human CARD-6 contains a region 
2 0 encoding a CARD domain. This CARD domain-encoding sequence was used to 
screen a proprietary database. This screening led to the identification of a clone. 
JthAb086d02, present in an adrenal gland library, which encodes full length 
human CARD-6. 

The 4244 nucleotide human CARD-6 cDN A of SEQ ID NO:54 has a 

2 5 3111 nucleotide open reading frame (SEQ ID NO:56) encoding a 1037 amino 

acid protein (SEQ ID NO:55). The cDNA and protein sequences of human 
CARD-6 are shown in Figure 28. 

N-glycosylation sites arc present at amino acids 49-52. 4 1 5-4 1 8. and 
812-815 of SEQ ID NO:55. Human CARD-6 contains cAMP and cGMP- 

3 0 dependent protein kinase phosphorylation sites at amino acids 1 5 1 - 1 54 and 42^- 
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432 of SEQ ID NO:55. Protein kinase C phosphorylation sites are present at 
amino acids 34-36,57-59, 135-137. 154-156. 161-163. 298-300, 339-341. 346- 
348, 443-445, 664-666. 693-695, 746-748, 882-884, 905-907 and 951-953 of 
SEQ ID NO:55. Casein kinase II phosphorylation sites are present at amino acids 
5 6-9,28-31,40-43, 112-115, 135-138, 154-157. 278-28 1, 32 1-324. 339-342. 354- 
357, 642-645, 670-673, and 707-710 of SEQ ID NO:55. Tyrosine kinase 
phosphorylation sites are present at amino acids 37-34 and 163-169 of SEQ ID 
NO:55. An ATP/GTP-binding site motif A (P-loop) site is present at amino acids 
775-782 of SEQ ID NO:55. 

1 0 Fi ^ ure - 9 depicts a hydropathy plot of human CARD-6. Relatively 

hydrophobic regions are above the horizontal line, and relatively hydrophilic 
regions are below the horizontal line. Cysteine residues are indicated by short 
vertical lines just below the hydropathy trace. 

Human CARD-6 is predicted to have a molecular weight of 1 1 6.5 kD 

15 before post-translational modification. 

Human CARD-6 includes a CARD domain at amino acids 5-92 of 
SEQ ID NO:55 (SEQ ID NO:64). Figure 30 depicts an alignment of the CARD 
domain domain of human CARD-6 and a consensus CARD domain derived from 
a hidden Markov model f SEQ ID NO:67). 

2 0 Northern blot analysis of rat CARD-6 expression revealed that 

CARD-6 is expressed at a high level in the heart (6.5 kb transcript and a 7 kb 
transcript). This analysis also revealed that human CARD-6 is expressed in the 
brain, spleen, lung, liver, muscle, and kidney. 

2 5 Example ] 6: CARD-6 increases intracellular signaling. 

The studies described in this Example demonstrate that CARD-6 
expression can increase intracellular signaling. 

In a first study, a vector which expresses rat CARD-6 under the • 
control of a CMV promoter was transiently transfected into 293 cells along with 

3 0 pNFxp-Luc (Stratagene Inc.. LaJolia, CA). The pNFKp-Luc vector is a reporter 
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plasmid in which a luciferase gene is under the control of a promoter which 
includes a TATA box and five NFk(3 binding elements. Cotramsfection of the ral 
CARD-6 expression vector increased luciferase expression by pNFicP-Luc 18- 
fbld over that observed in the absence of the rat CARD-6 expression vector. This 
5 result indicates that CARD-6 stimulates a signaling pathway involving NF-Kp. 

In a second study, a vector expressing CARD-6 under the control of 
the CMV promoter was transiently transfected into 293 cells along with pAP-1 - 
Luc (Strategene, Inc.). The pAP-l-Luc vector is a reported plasmid in which a 
luciferase gene is under the control of a promoter which includes a TATA box 

1 0 and seven AP-1 binding sites. Co-transfection of the rat CARD-6 expression 

vector increased luciferase expression by pAP-l-Luc 4-fold over that observed in 
the absence of the rat CARD-6 expression vector. This result indicates that 
CARD-6 stimulates a signaling pathway involving AP-L 

Additional studies suggest that CARD-6 can stimulate 
15 phosphorylation of CHOP (GADD153), possibly by activating the stress 
activated kinase. JNK/p38. 

Example 17: Deposit of Clones. 

A plasmid containing a cDNA encoding human CARD-3 (pXEI 7A) 

2 0 was deposited with the American Type Culture Collection (ATCC). Manasass. 

VA on May 14. 1998. and assigned Accession Number 203037. 

A plasmid containing a cDNA encoding human CARD-4L (pC4Ll ) 
was deposited with the American Type Culture Collection (ATCC). Manasass, 
VA on July 7. 1998. and assigned Accession Number 203035. 

2 5 A plasmid containing a cDNA encoding human CARD-4S (pDB33E) 

was deposited with the American Type Culture Collection (ATCC). Manasass, 
VA on May 14. 1998, and assigned Accession Number 203036. . 

A plasmid containing a cDNA encoding murine CARD-5 
(F-pMC5) was deposited with the American Type Culture Collection (ATCC). 

3 0 Manasass. VA on June 1 L 1999. and assigned Accession Number PTA-212. 
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A plasmid containing a cDNA encoding rat CARD-6 (EpRC6) was 
deposited with the American Type Culture Collection (ATCC), Manassas. VA. 
on June IK 1999, and assigned Accession Number PTA-21 1. 

A clone (EpHC5) containing a cDNA molecule encoding human 
5 CARD-5, a clone (EpCH6e) containing a cDNA molecule encoding an amino 
terminal portion of human CARD-6. a clone (EpHC6c) containing a cDNA 
molecule encoding a carboxy terminal portion of human CARD-6, and a clone 
(EpHC6) containing a cDNA molecule encoding human CARD-6 were deposited 
with the American Type Culture Collection (ATCC) Manassas, VA on June 1 1 . 

10 1999. as a composite deposit and assigned Accession Number PTA-21 3. To 

distinguish the strains and isolate a strain harboring a particular cDNA clone, one 
can first streak out an aliquot of the mixture to single colonies on nutrient 
medium (e.g.. LB plates) supplemented with 100 jig/ml ampicillin. grow single 
colonies, and then extract the plasmid DNA from a selected colony using a 

1 5 standard minipreparation procedure. Next, one can digest a sample of the DNA 
minipreparation with a combination of the restriction enzymes Sal I and Not 1 and 
resolve the resultant products on a 0.8% agarose gel using standard DNA 
electrophoresis conditions. The digestion will liberate DNA fragments as 
follows: 



20 



Human CARD-5 (EpHC5) 



0.6 kb and 3.0 kb 



Human CARD-6 amino- 
terminal portion (EpHC6e) 
(amino acids 1-279) 



1.0 kb and 4.3 kb 



25 



Human CARD-6 carboxy 
terminal portion (EpHC6c) 
(amino acid 93-1037) 



3.8 kb and 3.0 kb 



30 



Human CARD-6 (EpHC6) 
(amino acids 1-1037) 



4.2kband3.0kb 
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Equivalents 

Those skilled in the an will recognize, or be able to ascertain using no 
more than routine experimentation, many equivalents to the specific 
embodiments of the invention described herein. Such equivalents are intended to 
5 be encompassed by the following claims. 



What is claimed is: 
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1 . An isolated nucleic acid molecule selected from the group 

consisting of: 

a) a nucleic acid molecule comprising a nucleotide sequence 
which is at least 55% identical to the nucleotide sequence of SEQ ID NO: 1, 3, 7, 

5 9. 25. 27, 38, 40. 42, 48, 50, 51, 53, 54, 56, 60, 62, or the cDNA insert of the 

plasmid deposited with the- ATCC as any of Accession Numbers 

or a complement thereof; 

b) a nucleic acid molecule comprising a fragment of at least 300 
nucleotides of the nucleotide sequence of SEQ ID NO:l, 3, 7, 9. 25. 27. 38, 40, 

1 0 42. 48, 50. 51. 53, 54, 56, 60, 62, or the cDNA insert of the plasmid deposited 

with the ATCC as any of Accession Numbers . or a complement 

thereof: 

c) a nucleic acid molecule which encodes a polypeptide 
comprising the amino acid sequence of SEQ ID NO:2, 8, 26. 39, 41, 43. 49, 52, 

15 55, 6 1 . or amino acid sequence encoded by the cDNA insert of the plasmid 
deposited with the ATCC as any of Accession Numbers ; 

d) a nucleic acid molecule which encodes a fragment of a 
polypeptide comprising the amino acid sequence of SEQ ID NO:2, 8. 26. 39. 41. 
43. 49. 52. 55. 61. or the polypeptide encoded by the cDNA insert of the plasmid 

2 0 deposited with the ATCC as any of Accession Numbers , 

wherein the fragment comprises at least 15 contiguous amino acids of SEQ ID 
NO:2. 8. 26. 39, 41 . 43. 49, 52, 55, 61 . or the polypeptide encoded by the cDNA 
insert of the plasmid deposited with the ATCC as any of Accession Numbers 
; and 

2 5 e) a nucleic acid molecule which encodes a naturally occurring 
allelic variant of a polypeptide comprising the amino acid sequence of SEQ ID 
NO:2, 8. 26. 39. 41, 43, 49. 52. 55. 61. or the amino acid sequence encoded by 
the cDNA insert of the plasmid deposited with the ATCC as any of Accession 
Numbers . wherein the nucleic acid molecule hybridizes to a 
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nucleic acid molecule comprising SEQ ID NO:L 3, 7, 9, 25. 27, 38. 40, 42. 48, 
50, 5 1 , 53, 54, 56, 60. 62, or a complement thereof under stringent conditions. 

2. The isolated nucleic acid molecule of claim 1, which is 
selected from the group consisting of: 

a) a nucleic acid comprising the nucleotide sequence of SEQ ID 
NO: 1,3,7, 9, 25, 27, 38, 40, 42, 48, 50, 5 1 . 53, 54, 56, 60, 62. or the cDNA 
insert of the plasmid deposited with the ATCC as any of Accession Numbers 
. or a complement thereof; and 

b) a nucleic acid molecule which encodes a polypeptide 
comprising the amino acid sequence of SEQ ID NO:2, 8. 26, 39, 41, 43, 49, 52, 
55, 61 . or the amino acid sequence encoded by the cDNA insert of the plasmid 
deposited with the ATCC as any of Accession Numbers . 

15 3. The nucleic acid molecule of claim 1 further comprising vector 

nucleic acid sequences. 

4. The nucleic acid molecule of claim 1 further comprising 
nucleic acid sequences encoding a heterologous polypeptide. 

20 

5. A host cell which contains the nucleic acid molecule of claim 1 . 

6. The host cell of claim 5 which is a mammalian host cell. 

2 5 ... 7. A non-human mammalian host cell containing the nucleic acid 

molecule of claim 1. 
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8. An isolated polypeptide selected from the group consisting of: 

a) a fragment of a polypeptide comprising the amino acid 
sequence of SEQ ID NO:2, 8, 26, 39, 41, 43, 49, 52, 55, or 61, wherein the 
fragment comprises at least 15 contiguous amino acids of SEQ ID NO:2, 8. 26, 

5 39. 41,43,49.52,55, .or61; 

b) a naturally occurring allelic variant of a polypeptide 
comprising the amino acid sequence of SEQ ID NO:2. 8, 26, 39. 41. 43, 49. 52. 
55. or 61, or the amino acid sequence encoded by the cDNA insert of the plasmid 
deposited with the ATCC as any of Accession Numbers . 

1 0 wherein the polypeptide is encoded by a nucleic acid molecule which hybridizes 
to a nucleic acid molecule comprising SEQ ID NO: 1. 3, 7, 9, 25. 27. 38. 40. 42. 
48. 50. 51. 53. 54. 56, 60, 62. or a complement thereof under stringent conditions: 
and 

c) a polypeptide which is encoded by a nucleic acid molecule 

1 5 comprising a nucleotide sequence which is at least 65% identical to a nucleic acid 
comprising the nucleotide sequence of SEQ ID NO:l, 3, 7, 9. 25, 27. 38. 40. 42, 
48. 50, 5 1 . 53, 54. 56. 60, 62, or a complement thereof. 

9. The isolated polypeptide of claim 8 comprising the amino acid 
20 sequence of SEQ ID NO:2. 8.26,39,41,43.49, 52, 55. or 61 . 

10. The polypeptide of claim 8 further comprising heterologous 
amino acid sequences. 

25 1 1 . An antibody which selectively binds to a polypeptide of claim 8. 

12. A method for producing a polypeptide selected from the group 
consisting of: 

a) a polypeptide comprising the amino acid sequence of SEQ ID 
3 0 NO:2. 8. 26. 39.41. 43. 49, 52. 55, 61. or the amino acid sequence encoded b\ 
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the cDNA insert of the plasmid deposited with the ATCC as any of Accession 

Numbers ; 

b) a polypeptide comprising a fragment of the amino acid 
sequence of SEQ ID NO:2, 8, 26, 39, 41, 43, 49, 52, 55, 6L or the amino acid 
5 sequence encoded by the cDNA insert of the plasmid deposited with the ATCC 

as any of Accession Number __ , wherein the fragment comprises at 

least 1 5 contiguous amino acids of SEQ ID NO:2, 8, 26, 39, 4 1 , 43, 49, 52, 55, 
61. or the amino acid sequence encoded by the cDNA insert of the plasmid 
deposited with the ATCC as any of Accession Numbers : and 

10 c) a naturally occurring allelic variant of a polypeptide 

comprising the amino acid sequence of SEQ ID NO:2. 8, 26. 39, 4 1, 43. 49. 52. 
55, 61. or the amino acid sequence encoded by the cDNA insert of the plasmid 

deposited with the ATCC as any of Accession Numbers , 

wherein the polypeptide is encoded by a nucleic acid molecule which hybridizes 

15 to a nucleic acid molecule comprising SEQ ID NO:1. 3, 7, 9, 25. 27, 38, 40, 42, 
48, 50. 51, 53. 54, 56, 60, 62, or a complement thereof under stringent conditions; 

comprising culturing the host cell of claim 5 under conditions in 
which the nucleic acid molecule is expressed. 

2 0 1 3. A method for detecting the presence of a polypeptide of claim 

8 in a sample, comprising: 

a) contacting the sample with a compound which selectively 
binds to a polypeptide of" claim 8; and 

b) determining whether the compound binds to the polypeptide in 

2 5 the sample. 

14. The method of claim 13, wherein the compound which binds 
to the polypeptide is an antibody. 



30 
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15. A kit comprising a compound which selectively binds to a 
polypeptide of claim 8 and instructions for use. 

1 6. A method for detecting the presence of a nucleic acid molecule 
5 of claim 1 in a sample, comprising the steps of: 

a) contacting the sample with a nucleic acid probe or primer 
which selectively hybridizes to the nucleic acid molecule; and 

b) determining whether the nucleic acid probe or primer binds to 
a nucleic acid molecule in the sample. 

10 

1 7. The method of claim 16, wherein the sample comprises 
mRNA molecules and is contacted with a nucleic acid probe. 

1 8. A kit comprising a compound which selectively hybridizes to a 
15 nucleic acid molecule of claim 1 and instructions for use. 

1 9. A method for identifying a compound which binds to a 
polypeptide of claim 8 comprising the steps of: 

a) • contacting a polypeptide, or a cell expressing a polypeptide of 
2 C claim 8 with a test compound: and 

b) determining whether the polypeptide binds to the test 

compound. 

20. The method of claim 1 9. wherein the binding of the test 

2 5 compound to the polypeptide is detected by a method selected from the group 
consisting of: 

a) detection of binding by direct detecting of test 
compound/polypeptide binding; 

b) detection of binding using a competition binding assay: 

30 
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c) detection of binding using an assay for CARD-3, CARD-4, 
CARD-5. or CARD-6-mediated signal transduction. 

21. A method for modulating the activity of a polypeptide of claim 
5 8 comprising contacting a polypeptide or a cell expressing a polypeptide of claim 

8 with a compound which binds to the polypeptide in a sufficient concentration to 
modulate the activity of the polypeptide. 

22. A method for identifying a compound which modulates the 
1 0 activity of a polypeptide of claim 8. comprising: 

a) contacting a polypeptide of claim 8 with a test compound; and 

b) determining the effect of the test compound on the activity of 
the polypeptide to thereby identify a compound which modulates the activity of 
the polypeptide. 

15 
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-VTGGG AA7CC7AGC7 77-3AAGAGAC AGAHCAA7T7 7AGAAW7AAA 77GXAA77GA 

-GA777AAC7 AATG77G777 7AAAATATTG 7AAC777AAA GAATGA7G7C AGAAC77/7AA 
-AGGGXC777 GCAGAG7AGC AGSGG7777G GAGGGCGCGG C77GAA7C77 GATTGCC777 
Z7GC7GAGAG GACACA7G7A GC7GAAGA7G AA77TGGGAA AAGTAGC7G7 773CTA7777 
AAC7A7GGAA GAGCAGGGC7 AZAGTGAGA7 GGAAA7AATC 77ATCAGAG7 777ACC777A 



CA7T7AAT7A C7GAAAAGCA A7CGGGAAC7 777SGTCACT 7ACATC7GCA ATACTCAG7G 
T77GG7GGAC AACT7G77GA AGAA7GAC7A 777777GGCC GAAGA7G7GG AGA77G7G7G 
7GC7777777 ACCCAGC777 ACAAGG777G CAAAAT7CTG GACC7GG7AC AGAGCAAGGG 
CSAGGAGG7G 777GAGT777 7777C7AC77 GG777AGCAA C7CGCAGA7G C77ACG7GGA 
CC77AGGC77 7GGC7G77CG AGA7CGGC77 =777777777 77GCTCAC77 AGACCAAAG7 
■TG7GG7CAAC AC7GAC77AG 77AGCAGG7A TAC77AGCAG C7G7GACACC A777GGGCCG 
TGAC777AAG 777G7GC7G7 G77A7GC77A GAAGGAGGAG C7GC7G77GG AGGAGA777A 
IATGGACAC7 A7CA7GGAG7 77GT7GG777 GAGCAA7GAG AGC77GGGCA G777GAACAG 
C77GGC77GC C7777GGAC7 ACAC7AC7GG CA77777AAT GAGCAGGG7G AGACCA7777 
CA7C77GGG7 GATGC7GGGG 7GGGCAAG77 7A7G7TGC7A CAGCGGC7GC AGAGC7777G 
GGCCACGGGC CGGC7AGACG CAGGGG77AA A7777777T7 CACT777G7T GCCGCA7G77 
CAGC7G77T7 AAGGAAAG7G ACAGGC7G7G TC7GCAGGAC C7GCT777CA AGCAC7AC7G 
C7AC77AGAG CGGGAC777G AGGAGG7G77 7G7C77777G CTGCG77777 :G7ACC7GGC 
C77777ZAC7 777GA7GGC7 7GGACGAGC7 3CAC77GGAC 77GGAC77GA G77G7G7GC7 

AG777G777A C7777TGG77 77GC7GGCCA AC77G777AG 



TGGGAAG777 C77AAGGGGG C7AGCAAG77 G777ACAGC7 GGCACAGGCA 77GAGG773Z 
GCG77AG777 77GCGGAAGA AGG7G77777 CCGGGGC7T7 777C7CAGCC ACC7GCGCGC 
:7A7G77AGG AGGA7G7777 C7GAGCGG57 C77GCAGGAC CGCC7GC7GA G7CAGC7GGA 



GCCCAACCCC AACC777GCA GC77G7GC77 7G7GCC7777 77C7C77GGA 77ATCT7G7G 
37GC7777AG CACT777G7G C7GC7777GA AGGC77ACCA CAGC7G777G AC7GCACGA7 
GACC77GACA GATG777777 7777GG7CAC 7GAGG777A7 C7GAACAGGA TGCAGCC7AG 
CAGC77GGTG CAGCGGAACA CACGCAGCC7 AGTGGAGACC C77CACGC7G GC7GGGACAC 
T77G7G777G C7GGGGCAGG 7GGC77ACCG GGGCATGGAG AAGAGC7777 77G7C777AC 
C7AGGAGGAG G7GCAGGC77 77GGGC7GCA GGAGAGAGAC A7GCAG77GG GC77777GCG 
GGC777G7GG GAGC7G3GCC T7GGGGG7GA CCASCAS7CC 7A7GAG7777 7G7ACC77AC 
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rrrcc agtgcctgca gggcagtgg* cgggcccggg aagacctctt 
agttcaccaa crrcttcctg tgcgggctgt tgtccaaagc 
:aaacagaaa ctcctgcggc atctggtgcc cgcggcaggc ctcaggagaa agcccaaggc 
rrr— gggca cacctgtttt-"~ ccagcctgcg gggctacctg aaoagcctgc cccgccttca 



^"GAAAGC "GAACCAGG TGCAGGCGA7 GCCCACGTTC A7CTGGATGC TGCGCTGCA7 
"ACGAGACA CAGAGCCAGA AGGTGGGGCA GCTGGCGGCC AGGGGCATCT GCGCCAACTA 
rZZ ACCTACTCCA ACGCCTGCTC GGGCGACTGC AGCGCCCTCT CCTT! 



3CATCAGTTr "CAAGCGGC TGGCCCTAGA CCTAGACAAC AACAATCTGA ACGACTACGG 
CGTGCGGGAG CTGCAGCCCT GCTTCAGCCG CGTCACTGTT CTCAGACTCA GCGTAAACCA 
GATCACTGAC GGTGGGGTAA AGGTGCTAAG CGAAGAGC7G ACCAAATACA AAATTGT G AC 
"AT7TCGGT TTATACAACA AC ZAGAT CAC ZGATGTCGGA GCCAGGTACG TCACCAAAAX 

rrTGGATGAA tgcaaaggcc tcacgcat— taaactggga aaaaacaaaa taacaagtga 

AGGAGGGAAG TATCTCGCCG T3G C T "G AA GAACAGCAAA 7CAATCTCTG AGGTTGGGA7 
37GGGGCAA7 CAAG77GGGG ATGAAGGAGC AAAAGCCTTC GCAGAGGCTG TGCGGAACCA 
CCCCAGC7TG ACCACCC7GA G7GTTGCG7C CAACGGCA7C 7CCACAGAAG GAGGAAAGAG 
CC77GCGAGG GCCCTGCAGC AGAACACGTC 7C7AGAAA7A C7C7GGC7GA CCCAAAATGA 
ACTCAACGAT GAAGTGGCAG AGAGTTTGGC AGAAATGTTG AAAGTCAACC AGACGTTAAA 
CCA7T7A7GG CT7ATCCAGA A7CAGA7CAC AGCTAAGGGG ACTGCCCAGC 73GCAGA7GC 
3TT ACAGAGC AACACTGGCA TAACAGAGAT 7TGCC7AAAX GGAAACC7GA TAAAACCAGA 
3GAGGCGAAA GTGTATGAAG ATGAGAAGCG GATTATCTGT TTGTGAGAGG A7GC777CC7 

3— CATGGGG T7TT7GCCCT GGAGCGTCAG CAGGAAATGC CAG7C7GGGC AGT i ~ 

37CAG7GTGT TAAAGGGGCC 7GCGCAGGCG GGAC7A7CAG GAGTCCACTG C77VCA7GA7 
GCAAGCCAGC T7CC7C7CCA GAAGG7G7GG 7CGGCAAAC7 CCG7AAG7AC C7CCTACAA7 
7C7CCAGAAA AAGAATGTGT C77GCGAGC7 G77C7AG77A CAGTAAATAC ACTGTGAAGA 
3AAAAAAAAA ACGGACGCGT GG (SEQ ID NO: 7) 
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:-ZEC3HSi:-!si-?sESHr:-::cLL:<3:^L-~:-::sOT 
::<v?.:-::iz -vqskgesvshffl^lcci^ayvdi^^ 

?^^/?AFI^?HVAL~^ 
. ASKLL7;U17GIEVPRQFL?^^ 
?^~7IIF?XFQHFilAAFSG3?Ql^ 

^"--'^VIJDDRVGT-I^FFCZ^ 

QFTIILFLCGLLSXAKQKI^-^VPAAALFJIKRVALWAHLFSS^ 

TFIVJMLRCIYSTQSQKVGQLA^GICANYLKLTYCNACSADCSA^ 

^YGV7lELQ?CFSRLTVIJlL5VNQITrGGVXVLSZ2^^ 

DECXGL~-n^GKNKI73EGGXYLAIJVVKNSK3I^ 

SIASNGISTZGGKSIAilALCSNTSLSIl^TC^IiroEVAZSIAEMLKW 

:<GTA3LACALQSMTGITS:C1NQILIS?BSA2CVYSDEXRIIC? (SEQ m N0 . a) 
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—^^i^E^PE 0 ^ 

-™iE~^I G EJ;-;^i^i_^ 

w , T-guuCuiCGuw. v_ jj, — AGACGCAGGGG7CAAA77Z7^_TTCCAcrT-*--'*r — 

3CCGCATGTTCAGCTGCTTCAAGGAAAGTGACAGGCTGTGTCTG^GGACC^CTC^~ 

-r^E™E;^EX::" mCAC ™ CGATGGCC7GGA ^^ 

~^c^^~r~~__J:^^ 

^^i^ A 5zI ljCT ' , -^' G x L » VJW AAGC . SCTCaAGGGGGCTAGCAAGCTGCTCACAGCCCGCAC 
'^^^i^ ^^^^"^^^^^^^-^^^^"AAGAAGGTGCTTCTCCGGGGCTTCTCCC 

-^Ef^EEEFEEiEEEI-H 3 ^ 23 ^^^ 

US^i^- w * v " w - J - AGG ^^^ cc -=^CCTC^GCAGCCTGTGCTCTGTGCCGC•-C-- , 
~S? C 5^5 CACGATGACCGTGACAGATG ~^CTCC^ 

;:;^ AA ^ GGA JE:£SEE CA ^E AG £E TGGTC 

^ C ^EExE^^ G r jACACTGTGTGCTCGCTGGGG ^ GGTGGGG CACCGGGGCATGG 

^E^^EIEi?^ 




Zr?"ZX:r2?" w AW * Jl1 0XJ> - r CTCTTGAAG777ATACAACAACCAGATCAC 

E~I G x ^^ G ^ G ^ GGG ^^AAAACAAAATAACAAGTGAAGGAGGGAAGTATCTCG 

x wAAGGAGCAA ^ GCC ^ C2 ^ G AGGCTCTGCGGAACCACCCCAGCTTGACCAC 
rEi?^ GTCTTGCGTCCAACGGCATCTC ^ CAG ^ GG AGGAAAGAGC 

.vjuAGCAGAACACGTCTCTAGAAATACTGTGGCTGACCCAAAATGAACTCAACGATGAA 
i -GCAGAGAGTTTGGCAGAAATGTT2AAAGTCAACC\GACGTTAAAGCATTTATGGC'~ 

, A7 E CAGAATCAGATCACAGTcrTTTGTGT ^ GTGT ~^ 

J AC i .-.TCAGGAGTCGACTGCCTGCATGATGCAAGCC\GCTTCCTGTGCAGAAGGTC'-GG 
.uGGCAAACTCCCTAAGTACCZGCTACAATTCTGCAGAAAAAGAATGTGTCTTGCGAGC 
TG7TGTAGTTACAGTAAATACACTGTGAAGAGACTT7ATTGCCTATTATAATTATTTTT 
^TCTGAAGCTAGAGGAATAAAGCTGTGAGCAAACAGAGGAGGCCAGCCTCACCTCATTC 
-AACACC7GCCA7AGGGACCAACGGGAGCGAG77GG7CACCGC7C7777CA77GAAGAG 
TTGAGGATGTGGCACAAA GTTG GTGCCAAGCTTCTTGAATAAAACGTGTTTGATGGATT 
AGTATTATACCTGAAATATTTTCTTCCT7CTCAGCACTTTCCCATGTATTGATACTGGT 
CCCACTTCACAGCTGGAGACACCGGAGTATGTGCAGTGTGGGATTTGACTCCTCCAAGG 
TTTTG^GGAAAGTTAATGTCAAGGAAAGGATGCACCACGGGCTTTTAATTTTAATCCTG 
GAGTu . GAC7G7C7GC7GGCAAAGA7AGAGAA7GCCC7CAGC7C77AGC7GG7C7AAGA 
ATGA £ GATGCC ^ CAAAATG ~ GG ~"^^ 

-^wiAGAAGGCT3AG7ACCATGGGCTACAGTGTC7GGCCTTGGGAAGAAG7GATTCT3 
-CCCTCCAAAGAAATAGGGCA7G3Cr7GCCCCTGTGGCCCTGGCA7CCAAATGGC73C7 
, * iG7CTCCC7TACC7CG7GAAGAGGGGAAG7CTC7TCCTGCC7CCCAAGCAGCTGAAG 
-• G J G ^ C ~AAACGGGCGCC^GAC7ZAGGGGA7CGGC7GGGAAC7GGGCC^GCAGAGC^7 
Z _ aGGACACCCGCZACCA7GG7GGGC77G7GG7GGC73C7CCA7GAGGG7GGGGG7GA7 
•^C7AC7AGA7CAC77G7CG7777GCCAGC7CA777377AA7AAAA7AC7GAAAACACAA 

* 5. A ^ ^ * T>, W**^^»%**»«»^»mm»««»_ _ _ _________________ 
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.\VL1RG F S ? SHLRAYARRMFPSIl^QCRLLSQLEANPNLCSLCS^LrCWI 
- r RCrOHFUAAFEGS rQLPDCTT-ITLTrVFLLVTEVHLNRMQPSSLVORNTR 
£ ^Yf^^GRDTLCS ^^ v ^RGr!SKSLrVrTQEEVQASGLQERDMQL3rL 
.-ALrZLGPGGSQQSYSrrHLSLLrCXIGIPy (SEQ ID HO: 26) 
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^^^r-r"^ 1 U ^~^^^^^^^~^^^^^^^^^'^»C7C7GAGCGGC7 
GCGAAGTC i ^^^CC"^5G7GGCCAAG7GA77G7AAGTCAGGAGACTTrCC77CGG7" T TC 
TGCCTTTGATGGCAAGAGGTGGAGATTGTGGCGGCGATTACAGAAAACATC7GGGAAGAC 
AAG i a ^^^^^^^ATGGGAATCGCAGGCTTGGAAGAGACAGAAGCAATTCCAGAAATAA 
ATTGGAAATTGAAGATTTAAACAATGTTGTTTTAAAATATTCTAA^ 

CCAGAAACTTAAAAAGGGGCTGCGCAGAGTAGCAGGGGCCCTGGAGGGCGCGGCC T *GAAT 

CCTGATTGCCCTTCTGCTGAGAGGACACACGCAGCTGAAGATGAATTTGGGAAAAGTAGC 

CGCTTGCTACTTTAACTATGGAAGAGCAGGGCCACAGTGAGATGGAAATAATCCCATCAG 

AGTCTCACCCCCACATTCAATTACTGAAAAGCAATCGGGAACTTCTGGTCAC7CACATCC 

GCAATAC7CAGTGTCTGG7GGACAACTT3CTGAAGAATGACTACTTCTCGGCCGAAGATG 

CGGAGA7TGTGTG7GCCrGCCCCACCCAGCC7GACAAGGTCCGCAAAATTCTGGAC-GG 

7ACAGAGCAAGGGCGAGGAGG7G7CCGAG77C77CC7C7AC77GCTCCAGCAAC7-GCAG 

ATGCC7ACG7GGACC7CAGGCC77GGC7GC7GGAGA7CGGC77C7CCCC77CCC7GC7CA 

C7CAGAGCAAAG7CG7GG7CAACAC7GACCCAG7GAGCAGG7ATACCCAGCAGC7GCGAC 

ACCA7C7GGGCCG7GAC7GCAAG77CG7GC7GTGC7A7GCCCAGAAGGAGGAGC7GC7GC 

7GGAGGAGA7C7ACA7GGACACCA7CA7GGAGC7GG77GGC77CAGCAA7GAGAGCC7GG 

GCAGCC7GAACAGCC7GGCC7GCC7CC7GGACCACACCACCGGCA7CC7CAA7GAGCAGG 

C7GC77CAAGGAAAG7GACAGGC7G7G7C7GCAGGACC7GC7C77CAAGCAC7AC^GC7A 

CCCAGAGCGGGACCCCGAGGAGG7G777GCC77CC7GC7GCGC77CCCCCACG7GGCCC7 

C77CACC77CGA7GGCC7GGACGAGC7GGAC7CGGAC77GGACC7GAGCCGCG7GCC7GA 

CAGC7CC7GCCCC7GGGAGCC7GCC^CCCCC7GG7C77GC7GGCCAACC7GC7CAG7GG 

GAAGC7GC7CAAGGGGGC7AGCAAGC7GC7CACAGCCCGCACAGGCA7CGAGG7CCCGCG 

CCAG77CC7GCGGAAGAAGG7GC77C7CCGGGGC77C7CCCCCAGCCACC-GCGCGCC7A 

TGCCAGGAGGA7G77CCCGGAGCGGGCCG7GCAGGACCGCC7GC7GAGCCAGC7GGAGGC 

CAACCCCAACCTC7GCAGCC7G7GC7C7G7GCCCC7C77C7GC7GGA7CATC77C:CGG7G 

C7TCCAGCAC77CCG7GC7GCC777GAAGGC7CACCACAGC7GCCCGAC7GCACGA7GAC 

CC7GACAGA7GTC7TCC7GC7GG7CAC7GAGG7CCA7C7GAACAGGA7GCAGCCGAGCAG 

CC7GG7GCAGCGGAACACACGCAGCCCAG7GGAGACCC7CCACGCCGGCCGGGACAC7C7 

G7GC7CGC7GGGGCAGG7GGCCCACCGGGGCA7GGAGAAGAGCC7C777G7C77CACCCA 

GGAGGAGG7GCAGGCC7CCC-GGC7GCAGGAGAGAGACA7GCAGC7GGGC7TCC7GCGGGC 

777GCCGGAGC7GGGCCCGGGGGG7GACCAGCAG7CC7A7GAG77777CCACC7CACCC7 



FIG. 10 (Paae I of 3 



BNSDOOD <WO 010<»?6A2 I > 



WO 01/00826 PCT/US00/17691 



CCAGGCC- '.'li. -ACAGCCTTC7TCC7GG7GC7GGACGACAGGG7GGGCAC7CAGGAGC7 

gctcaggttcttccaggag7gga7gccc2ctgcgggggcagcgaccacgtcctgctatcc 

7ccc77cc7ccc3ttccag7gcc7gcagggcag7gg7ccggcgcgggaagacc7c77caa 

gaacaagga7cac77ccag77caccaacctc7tcctg7gcgggc7g7tgkccaaagcgaa 

acagaaac7cctgcggca7c7gg7gcccgcggcagccc7gaggagaaagcgcaaggccc7 

gtgggcacacctgttttcgagcctgcggggctacctgaagagcctgccccgggttcaggt 

cgaaagcttcaaccaggtgcaggccatgcgcacgttcatctggatgctgcgct 

cgagacacAgagccagaaggtggggcagctggcggcc\ggggcatctgcgccaactagct 

caagctgacctactgcaacgcctggtcggccgactgcagggccctctccttcgtcctgga 

tcacttccccaagcgggtggccctagacctagacaacaacaatctcaacgactaeggcgt 

gcgggagctgcagccctgcttcagccgcctcagtgttctcagactcagcgtaaaccagat 

CACTGACGGTGGGGTAAAGGTGCrAAGCGAAGAGCTGACCAAATACAAAATTGTGACCTA 
TTTGGGTTTATACAACAACCAGATCACCGATGTCGGAGCCAGGTACGTCACCAAAATCCT 
GGA7GAA7GCAAAGGCC7CACGCA7C77AAAC7GGGAAAAAACAAAA7AACAAG7GAAGG 
AGGGAAG7A7CTCGCCC7GGC7G7GAAGAACAGCAAATCAATC7C7GAGG77GGGATG7G 
GGGCAATCAAGTTGGGGATGAAGGAGCAAAAGCCTTCGCAGAGGCTCTGCGGAACCACCC 
CAGCTTGACCACCCTGAGTCTTGCGTCCAACGGCATCTCCACAGAAGGAGGAAAGAGCCT 
TGCGAGGGCCCTGCAGCAGAACACGTCTCTAGAAATACTGTGGCTGACCCAAAATGAACT 
CAACGA7GAAG7GGCAGAGAG77TGGCAGAAA7G7TGAAAG7CAACCAGACG77AAAGCA 
TT7ATGGCTTATCCAGAATCASATCACAGCTWARGGGACTGCCCAGCTGGCAGATGCGTT 
ACAGAGCAACACTGGCATAACAGAGATTTGCCTAAATGGAAACCTGATAAAACCAGAGGA 
GGCCAAAG7C7A7GAAGA7GAGAAGCGGA77A7C7G7T7C7GAGAGGA7GC777CC7G77 
CA7GGGG77T77GCCC7GGAGCG7CAGCAGCAAA7GCCAC7Y7GGGCAG7C7777G7G7C 
AGTGTCTTAAAGGGGCCTGCGCAGGCGGGACTATCAGGAGTCCACTGGCTCCATGATGCA 
AGCCAGC77CCTG7GCAGAAGG7C7GG7CGGCAAAC7CCC7AAG7ACCCGC7ACAA77C7 
GCAGAAAAAGAATGTGTCTTGCGAGCTGTTGTAGTTACAGTAAATACACTGTGAAGAGAC 
7TTATTGCCTATTATAATTATT7TTATCTGAAGCTAGAGGAATAAAGCTGTGAGCAAACA 
GAGGAGGCCAGCCTCACCTCATTCCAACACCTGCCATAGGGACCAACGGGAGCGAGTTGG 
TCACCGL I'Ll'TT'rCATTGAAGAGTTGAGGATGTGGCACAAAGTTGGTGCCAAGCTTCTTG 
AA7AAAACG7G7T7GA7GGA77AG7A77A7ACC7GAAA7A7777C7TCC77C7CAGCAC7 
TTCCCATG7ATTGATACTGGTCCCACTTGACAGCTGGAGACACCGGAG7ATGTGCAGTG7 
GGGA777GAC7CC7CCAAGG7777G7GGAAAG77AA7G7CAAGGAAAGGA7GCACCACGG 
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GCTTTTAATTTTAATCCT3GAGTCTCACTGTCT3CTGGCAAAGATAGAGAATGCCCTCAG 
CTCTTAGCTGGTCTAAGAATGACGATGCCTTCAAAATGCTGCTTCCACTCAGGGCTTCTC 
CTCTGCTAGGCTACCCTCCTCTAGAAGGCTGAGTACCATGGGCTACAGTGTCTGGCCTTG 
3GAAGAAGTGATTCTG7CCCTC'2AAAGAAATAGGGCATGGCTrGCCCCTGTGGCCCTGGC 
ATCCAAATGGCTGCTrTTGTCTCCCTTACCTCGTGAAGAGGGGAAGTCTCTTCCTGCCTC 
CCAAGCAGCTGAAGGG7GACTAAACGGGCGCCAAGACTCAGGGGATCGGCTGGGAACTGG 
GCCAGCAGAGCATGTTGGACACCCCCCACCATGGTGGGCTTGTGGTGGCTGCTCCATGAG 
GGTGGGGGTGATACTAC7AGATCACTTGTCCTCTTGCCAGCTCATTTGTTAATAAAATAC 
TGAAAAC C CAAAAAAAAAAAAAAAAAAAAAAAAAAAGGGCGG (SEQ ID NO: 38) 
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MEEQGH5 EME 1 1 PS ESKPHI GLLKSNRSLLVTHIRNTQCLVDNIiLiQJDYFSAEDAE IVCA 

CPTQPDKVRKII^LVQSKGEEVSEFFLYI^Q^ 

VinpPVSRYTQQLRHHLGRDSKFVLCYAQKEE^ 

ACl^irrTGII^QAASR'iqn'GCVCRTCSSST^ 

WTSCTRTWT (SEQ ID NO: 39 ) 
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as 
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^CGCGTCCGCGCTACTSCGGGAGCAGCGT=CTCCCGOGCCACGGCCCTTCCCGGCCZ3G 
GCGTCCGCGGACCATGGCGCTC7CCGGGCTGTTCTCTAGCTCTCAGCGGCTGCGAAGTCT 
G7AAACC7GG7GGCCAAG7GA77G7AAG7CAGGAGAC7T7CC77CGG777C7GCC777GA 
7GGCAAGAGG7GGAGA77G7GGCGGCGA77ACAGAAAACA7C7GGGAAGACAAG77GC7G 
- j. A" i i ATGGGAATCGCAGGCTTGGAAGAGACAGAAGCAATTC^GAAATAAATTGGAAAT 
TGAAGATtTAAACAATGTTGTTTTAAAATATTC^ 

TAAAAAGGGGC7GCGCAGAG7AGCAGGGGCCC7GGAGGGCGCGGCC7GAA7CC7GA77GC 

CCTTCTGCTGAGAGGACACACGCAGCTGAAGATGAATTTGGGAAAAGTAGCCGCTTGCTA 

CTTTTAACTATGGAAGAG«GGGCCACAGTGAGATGGAAATAATCCCATCAGAGTCrCACC 

CCCACATTCAATTACTGAAAAGCVATCGGGAACTTCTGGTCACTCACATCCGCA^ 

AGTGTCTGGTGGACAACTTGCTGAAGAATGACTACTTCTCGGCCGAAGATGCGGAGATTG 

TGTGTGCCTGCCCCACCZAGCCTGACAAGGTZCGCAAAATTCTGGACCTGGTACAGAGCA 

AGGGCGAGGAGG7G7CCGAG77C77CC7C7AC77GC7CCAGCAAC7CGCAGA7GCC7ACG 

TGGACC7CAGGCC77GGC7GC7GGAGA7CGGC77C7CCCC77CCC7GC7CAC7CAGAGCA 

AAG7CGTGG7CAACACTGACCC\GGTAGGAG7CAGCCCCAGCAAGACCGCAGGCACC^ 

GCAAGCAGGGCCCTGGGGGG777GG7AA7GGC7GGGCCAGCCC7GAG7GCCACC7CAGGA 

AGCAGGCCCAGG7GC7A7777GA7777AGAAAGGAACAGC7GAA7CC7G7C7CCCAAG7G 

CAGCCCAGG7GGC7GCGA77GAAC7GCCCACACC7CGA7GG7C7GG777A7AGAGGGGCC 

7TTGGAAG7A7GGGAA7GGCC7G7G77C7GACCCCTTGCT7TC77CC7A7TC7GACA7AT 

G7AGACA7777AA7GG77GCACAAA77CAAGG77G7A777777777C777AAAAAAA7C7 

T7AGC7GGACA7GG7AGCACACACC7G7AG77GCAGC7AC7CAGGAGGC7GAGGCAAGAG 

3AC7GC77GAGCCCZAGAG7C7AAGGC7GCAGCGAGCTATGA77G7GCCGC7ACAC7CCA 

CAGCC7GGG 1 1 1 I AGAG7GAGACCG7G7C7C7AAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAANGGGCGG (SEQ ID NO:4q) 
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VNTD PGRS Q PQQDRRHQCKQGFGGFGNGWAS PE CHLRKQAQVLF 
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r^CGCGTCCGCGGACCGCGAGCGGTAGCGCCCTCCCrccCAGCTGTTGTCCCGCC-GAT 

CCGCGACCCTAGTCCCCGGATCZGG7TGCTGAGAG7CACCGTACTCCAGGGCCAACTGAG 

CCAAAGTCCrGCCAACTTGGGTCAGCAATGAAAGGCAGGATCC7GGGTGGTGGCCC , *GAA 

rCCTGATTTGTCTGCCCTGCCAGCGAGACACATGTGGTCAAAGATGAATTrGAGAAAAGT 

AGCTGCTGGCTACTTGAACAATGGAGGAACACGGCCATCATGAGATGGAAGGCACC'*CAT 

rGGGTTGTCACTCCCACATTAAACTGCTGAAGATCAAGlGGGAACATCTGGTCACCAAai 

TTCGGAACACTCAGTGTCTGGTGGACAACTTGCTGGAGAATGGCTACTTCTO^GCCGAAG 

ATGCAGAGATTGTGTGTGCCTGTCCCACCAAGCCTGACAAGGTCCGAAAGATCCTTGACC 

TGG7GCAGAGCAAAGGCGAGGAGG7GTCTGAGTTCTTCCTCTACGTGCTGCAGCAGCTGG 

AGGA7GC77ACG7GGACC7CAGGC7G7GGC7C7CACSAAA77GGC77C7CCCC77CCCAGC 

TCA77CGGACCAAAAC7A7CG7.CAA7AC7GACCCAG7AAGCAGG7A7ACCCAACAGC7GC 

-UA.CAAC7GGGCCGCGAC7CCAAG77CA7GC7G7GC7ACGCCCAGAAGGAGGACC-GC 

7GC7GGAGGAGACC7A7A7GGACACAC7CA7GGGGC7GG7AGGC77CAACAA7GAAAACC 

7GGGCAGCC7AGGAGGCC7GGAT7GCC7GC7GGACCACAG7ACGGGCG7CC7CAACGAGC 

A7GGCGAGAC7G7C77CG7G77CGGGGACGCGGGAG7GGGCAAG7CCA7GC7GC7GCAGA 

GG77GCAC^GCC7C7GGGCG7CAC^CAa;77GACC7CCACAGCCAAATTC77CT7C^ 

TCCGC7GCCGCA7G77CAGC7GC77CAAGGAGAGCGACA7GC7GAG7C7GCAGGACC7GC 

TCT7CAAGCA777C7GC7ACCCGGAGCAGGACCCCGAGGAGG7GT7C7CC77CT7GC7GC 

GCT77CCCCACACAGCGC7C77CAC7777GACGGCC7GGA7GAGC7GCAC7CAGAC7-CG 

ACC7GAGCCGCG7GCCGGA7AGC7GC7GCCCC7GGGAGCCGGC7CACCC7C7GG7CC-GC 

TGGC7AACC7CC7AAG7GGGAGGC7GC7CAAGGG7GCCGGCAAA77GC7CAC7GC7CGCA 

-AGGCG7GGAGG7CCCCGGCCAGC7— GCGCAAAAAGG7GC7GC7CCGGGGC77C7CCC 

^AG7CACC7GCGCGCC7A7GCCCGCCGGA7G77CCCCGAGCGCACAGCGCAGGAGCA7C 

7GC7GCAGCAGC7GGA7GCCAACCCCAACC7C7GCAGCC7G7GCGGGG7GCCGC7C77C7 

G77GGA7CATC77CCG77G777CCAGCAC7TCCAGACGG7C77CGAGGGC7CC7C7~CAC 

AG77GCCGGAC7G7GC7G7GACCC7GACCGA7G7C777C7GC7GG7CAC7GAGG7GCA7C 

7GAACAGGCCGCAGCCCAGCAGCC7GG7GCAGCGCAACACGCGCAGCCCGGCGGAAACCC 

7ACG7GCAGGC7GGCGCACGC7GCA7GCGC7GGGAGAGG7GGC7CACCGAGGCACCGACA 

AGAGCC7C7TTG7G777GGCCAGGAGGAGG7GCAGGCG7CGAAGC7GCAGGAAGGAGA7C 

rGCAGu i GGGC77CC7GCGGGC777GCCCGA7G7GGGCCC7GAGCAGGGCCAG7C77ACG 

.=A77777CCACC77ACGC7CCAGGCC77C77CACC3CC77C77CC7GG7AGCAGA7GACA 

AAG7GAGCACCCGGGAG77GC7GAGG77C777CGAGAA7GGACG7C7ZC7GGAGAGGCAA 
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£AAGC7CG7CCT3CCA77C7TCC77G7TC7CCTTCCAG7GCC7GGGCGGCAGAAGCCGG7 
TGGGCCCTGATCCTTTCAGGAACAAAGATCACTTCCAGTTCACCAACCTCTTCGTGTGCG 
GGCTACTGGC CAAAGCCC3ACAGAAAC7C C7TCGGCAGC7GG7G CCCAAGGCTATCCTGA 

3GAGGAAGCGCAAGGCCCTGTGGGCTCACCTGTTTGCTAGCCTGCGCTCGTACTTGAAGA 
GCCTACCTCGGGTCGAG7CTGGAGGCTTTAACCAGGTGCATGCCATGCCCACATTCCTG7 
GGATGCTGCGCTGCATCTATGAGACGCAGAGCCAGAAGGTGGGGCGCCTCGCCGCCAGGG 
3CA7CAG7GCGGAC7ACC7CAAGC7GGCC7TTTGCAACGC77GCTC7GCGGAC7GCAGCG 
CCCTGTCCTTCGTCCTGCATCACTTCCACAGGCAGCTGGCCCTAGACCTGGACAACAACA 
ACC7CAA7GAC7A7GGCG7GCAGGAGC7GCAGCC77GC777AGCCG7C7CACGG77A7CA 
GAC7CAGCG7CAACCAGA7CACCGACACGGGGG7GAAGG7GC7A7G7GAGGAAC7GACCA 
AG7A7AAGA7CG7GACG77CC7GGG77TA7ACAACAACCAGA7AAC7GA7A7CGGAGCCA 
GGTATGTGGCCCAAATCCTGGATGAATGCAGAGGCCTCAAGCACCTTAAAC7AGGGAAAA 
ACAGAA7AACAAG7GAGGGCGGGAAG7G7G7GGC777GGC7G7GAAGAACAGCACC7CCA 
TCGTTGA7GTTGGGATGTGGGGTAATCAGATTGGAGACGAAGGGGCAAAGGCCTTCGCAG 
AGGCA77GAAGGACCACCCCAGCC7GACC\C7C7CAG7C77GCA77CAA7GGCA7C7C7C 
CGGAGGGAGGGAAGAGCC77GCGCAGGCCCTGAAGCAGAACACCACAC7GACAG7AA7C7 
GGC7GACCAAAAA7GAAC77AA7GA7GAG7C7GCAGAG7GC77CGC7GAGA7GC7GAGAG 
7GAACCAGACGC7ACGGCA777A7GGC7GA7CCAGAA7CGCA7CACAGCCAAGGGGACAG 
CGCAGC7GGCGAGGGCAC7GCAGAAGAACACAGCCA7AACAGAGA777G7C7CAA7GGAA 
AC77GA77AAGCCCGAGGAGGCCAAAG7C77CGAGAA7GAGAAGAGAA7CA7C7GC7TC7 
GACGGACGC7CC7GGGCAGGA7C777G7CC7AGG77GC7CC7CAG7CACAGACAGCAC7G 
TGCAG7CAGCAGGG7AGCAGGA7GC7G7GCAGCGCC7GCAGCAAGG7GCC737CAGGAGC 
3CACACCTCCACAG7GCACACC'3AT3TCCCCTGCTCATGC773SACT3GTAGCACCCGCG 
CCGCGGC7GAGACCG7GCAGACGCAGGGAG7CT7AGGAACCA7CG7CACCAC7CAAAGCC 
AGCAGGGCA7C7TC7G7ACAAAGA7C7CCC7GCA7A7CCAC7AGACGGAAGC7GAAGGAA 
CGCAACAGCAGAGGAGGCCAACAGACGCC7GGC7GAAGGC7GCG7GGGACCAACGG7G7C 
ACC77CAGAAAAGAGC7GGGAAC77GAGCAGAGCCGA7GG7AAC77C77GGGGAAAGAAG 
GCACCCAG7GAC7GCA7GG77A77C7GAG7CC7CC77CC7C7GC77AG7CCC7'C7CAC7G 
7ACAGG7C7G77TC77CC7CGCAGC7G7GGC7GC7GAAG7AGG7CCAC7G7GGGGAGAGC 
7CA7CACAGAC777GG77CGG77C7GGA77C7CAG7GG7GGCAACCGAGAG7CAGACGA7 
ACCC7C7AGG7CAG7CTCAGAGGA7C7C7A7GC7G7GAGAGGG77GAGGGCCCACCCAGA 
ATT777TT77T77ACCAG77T77AC7G7GCC7GCCCCAGGAGGGAGAATTACT7CCCAGC 
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TOEHGKKSMEGTPLGCESKIXIJJCX^ 

Z?TKPDKVRKII^LVQSKGE^/SE??L^/LQQLEDAYVDLRLWLSEIGFS?SQLIRTiCTI 
"7NTD ? VS RYTQ Q LRHQ LGRD S KFMLCYAQ KEDLLLEETYMDTLMGLVGFNNENLG S LGGL 

3CLI^HSTGVLNEHGE7^/FVFGDAGVGKSM1^QI^QSLWASGRLTSTAKFFFHFRCHMFS 

CFKESDMLSLQDLLFKHFCYPEQDPEZVFSFLLRFPHTALFTFDGLDELHSDFDLSRVPD 
SCCPWEPAHPLVLIANLLSGRLLXGAGXIJ-TARTG^ 

ARRMFPERTAQEHLLQQLDANPNLCSLCGVPLFCWI I FRCFQHFQTVFEGS SS QLPD CAV 
TLTDVFLLVTEVHLNRPQ PS SLVQRNTRS PAETLRAGWRTLHALGEVAHRGTDKS LFVFG 
QEEVQASKLQEGDLQLGFLRALPDVGPSQGQSYEFFHLTLQAFFTAFFLVADDKVSTREL 
LRFFREWTSPGEATSSSCKSSFFSFQCLGGRSRLGPDPFRNKDHFQFTNLFVCGLLAKAR 
QKLLRQLVPKAIlJlRKRKALWAHLFASLRSyLKSLPRVQSGGFNQVHAMPTFLWMLRCIY 
ZTQSQKVGRLAARG I SAD YLKLAFCNACSADCSALS FVLKHFHRQLALDLDNNNLNDYGV 

QELQPCFSRLTVIRLSVNQI7DTGVXVI.GEELTKYKIVTFLGLYNNQITDIGARYVAQIL 
DECRGLIOiLKLGKlTOITSEGGKCVALAVTCNSTSIVDVGMWGNQIGDEGAXAFAEALKDHP 

3LTTLSIAFNGISPEC-GKSIAQALKQNTTLTVIWLTKNELNDESAECFAEMLRVNQTLRH 
LWLI QNRITAKGTAQLARALQKNTAITEI CLNGNLI KPEEAKVFENEKRI I CF 
(SEQ ID NO: 43) 
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gatcatcgttcactgcagccttgaactcttgtgctcatgtgatcctcctgccttagcctccccaa 

tagctgggaccacaggtgcgccaccacgcctggctaattttttttatttttgtagagatgggtgt 

cncactatgttgcacaggttggtctcaaactactggccttacttcaagctatctacccatctcag 

cctcccaaagcgctgggattacagtcatgagccaacttgcctggccagataaaggtcttaagcat 

ggttccttcctgccctaggtagagaaaccccacaaccagtgggaggtggggtgagctctttctgt: 

agcttttgctttgctgatgacgtcattgatctcttcaggggctgcgcagagcagcaggggccctg 

gagggcgcggcctgaatcctgattgcccttctgctgagaggacacacgcagctgaagatgaattt 

gggaaaagtagccgcttgctactttaactatggaagagcagggccacagtgagatggaaataatc 

ccatcagagtctcacccccacattcaattactgaaaagcaatcgggaacttctggtcactcacat 

ccgcaatactcagtgtccggtggacaacttgcngaagaatgactacttctcggccgaagatgcgg 

agattgtgtgtgcctgccccacccagcctgacaaggtgccccggggacagggacgggcatggcat 

tgtgtggaccccgggagctagaagaggcctctccctgctgatctgagtgaagagcgtgggagttt 

agtccagcgggcagggctgcattttggggtactiaatagcacacaaatgcctgggttagcaggttg 

cacagtcaggtattttacttctgtgtttgtgtctggagcaaaccctgacatctcagttctcattg 

ctgtgcgtattggttcccagacacttcanttttagatcccctttaaattaggagggaaaaagaac 

ataagcataagagcatccccagcagcgatgttcattcagtgcctctgaaggctggagggctgctt 

gttgctgggtgagactcggaggggaaccgactcagggtcaggaatgatgacatcccacggtgggt 

ccacagtgaagaatcttccccgctccactgtgggacgccttaacagcccttacttccacttacgc 

tttgcgttanctcctgaaaaataaaacggagaccacaaattccttcttggttagaggaatgacac 

aactcatttatgacatgaccccgctgggactcagaagagaccaggacggtttctgggggaagcag 

cagcacactcgcgcgccrtgccccccccccttgacctgtitttcccacatttttaacaagaaaaaa 

agccgtttttaatatacggcccatcgccctcccactgtgtggcccaggcgcccaccccattacgc 

ccaaggggtggttctcacctctccactcticatncctgcacagcagttgtgccaggttaagaggga 

caaggagaaggctgggcaccgtggctcacgcccgtaatcccagcacnttgggaggccgaggcagg 

cagatcacctaaggt.caggagtttgagaccagcctggccaacatggggaaaacccgtctct:aata 

aaaacacaaaaactagtcgggcatggtggtgggtgcctgtaatcccagccacttgggaggctgag 

gaaagagaattccttgaacctgggaggtggaggttgcagtgagccaagattgtgccattgcactc 

cagccctccagcctgggcgacagagcaagactctgtctcaaaaaagaaaaaaaaaaaaaagaggt 

agagaagrccatggccatttgtctgtcctttttatttttaggctcatggaagcctcctggcttct 

cagagctgagtggttttatttcttgcccaggaggtcatttcacagattttcgggctccaacatgt 

tgactgtcacagcagctggggggatggcatagctaccggctgtactaagaactcagagccctgcc 

crgagcctgcctgagggtccttacggnaggaggatgcccctcacgccagcccgtgcccccatgct 

tgtgtcacctccaggtccgcaaaattctggacctggtacagagcaagggcgaggaggtgtccgag 

ttcttcctctactcgctccagcaactcgcagacgcctacgtggacctcaggcctcggctgctgga 

gatcggcttcrccccctccccgctcacccagagcaaagccgcggtcaacactgacccaggtagga 

gtcagccccagcaagaccgcaggcaccagtgcaagcagggccctggggggtttggtaatggctgg 

gccagccctgagtgccacctcaggaagcaggcccaggtgctattttgattttagaaaggaacagc 

tgaatcctgtctcccaagtgcagcccaggcggctgcgactgaactgcccacaccccgatggtctg 

gtttatagaggggcccttggaagtatgggaatggcctgcgttccgaccccttgccctcctcctat 

tcngacatatgtagacatttraatggttgcacaaattcaaggttgtanttttttttctttnaaaa 

aaatcrttagctggacatggtagcacacacctgtagctccagccactcaggaggctgaggcaaga 

ggactgcttgagccccagagtctaaggctgcagcgagctatgattgtgcccctacactccagcct 

gggcgacagagtgagacccngtctcnaaaaaaggaaagaaaaaaatnaaaaagccttgccaggtt 

cgactccaggcaaagtattccgtcaccgttgagtgccagcccttacttccaaactaatggaagac 

cccatcagttaactgattagttcaataagtattttttgctgtatccaccacatgccaagacccta 

cactgtgccggatgrcagggagacagcggcgagcagacacagacagggcccctgcccccagggag 

ccccaagccagctggaagagaccaccagtcagcaatcccaaaaatgtgtcaggacagcggcagnc 

caaggcatgtgagaacatatcartagggccaggatcngctccggggcaggagtctcctctccccg 

ctttt:gaaccc!:ccactT:tgagacagccgctggcaacataccagcaccaaggacct.aagtcccgc 

cttttaaagaatccaatatgrcgctggaaacagaagcacaagacaggcgcgcgcttaggggaaac 

aaggccagccggcagagrgtcagcgctaggccccagcctccacagcccccgcaggcgcctgccag 

ccactgctagcttctgactctgtctgccccttcctgtctccccctgtttcccccccccatgaaaa 

aaaaagaaagtattcccatgaggaaccattctttcgaaagacttctccgttggccccgctagcca 

gccactttactagcttttacagcgtaactcaccccacaagcagccccacacaaaagactacacat 
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tgtatgattctgtttatatgaaacgtccagaaaaggtaaatctatagacaaagcaaatcagtaqt 

tgcctacggcccagggattggccacaaataggctccagaaaactctgggaagatggtagagatat 
tctagacctggactgtggtgaggtttgcacaactttgtaaact^ 

tataacactccccaacactttgggaggccgaggtgggcagaccgcttgaacccaggaactcgaga 

ccagcctgggcaacatggcgagaccccgtctctacaaaaaaacacaaaaattagttgggcttggt 

ggcatatgcctgtgtcccagctacttgggaggctgaggtgggaggattgcttgagcctgggagtt 

cgagactgcatgatcgggtcactgcaccctagcctgagtgacagagcaggaccctatctctaaca 

acaaaaaagcagtgctggtggaggagggccagcgtggccatctggcctggccctcgagtgcgac?a 

ggcttcagtgtttagctgcagttcagtgatgacactgtgcggaggaataagggtggccngtccci 

gacactgatcccagctgaagtttgtcaccttctttctggcaaatctgaggtcaagcagagagatc 

aaagcctggggccctcagggtcaggaatgGtggctctgtgacgctccccaggccctgcatctgag 

gagtggctgcgctggcctcagggcccaggttgtgaat-tttgtttatgcactcgcccctcctcttt 

gagacctccctgtttgatgctgtttctgcctctctcctcaccctgctgctgtgccctgccacccc 

ctccctccagtgagcaggtatacccagcagctgcgacaccatctgggccgtgactccaagttcgt 

gctgtgctatgcccagaaggaggagctgccgctggaggagatctacatggacaccatcatggagc 

tggttggcttcagcaatgagagcctgggcagcctgaacagcctggcccgcctcctggaccacacc 

accggcaccctcaatgagcagggcgagaccatcttcatcctgggtgatgctggggtgggcaagtc 

cacgctgctacagcggctgcagagcctctgggccacgggccggctagacgcaggggtcaaattct 

tcttccactttcgctgccgcatgttcagctgcctcaaggaaagtgacaggctgtgtctgcaggac 

ctgctcttcaagcactactgctacccagagcgggaccccgaggaggtgtttgccttcccgctgcg 

cttcccccacgtggccctcttcaccttcgatggcctggacgagctgcactcggacttggacctga 

gccgcgtgcctgacagctcctgcccctgggagcctgcccaccccctggtcttgctggccaacctg 

ctcagtgggaagctgctcaagggggctagcaagctgctcacagcccgcacaggcatcgaggtccc 

gcgccagttcccgcggaagaaggtgcctctccggggcttctcccccagccacctgcgcgcctatg 

ccaggaggatgttccccgagcgggccctgcaggaccgcctgctgagccagctggaggccaacccc 

aacctctgcagcctgtgctctgcgcccctcttctgctggatcatcttccggtgcttccagcactt 

ccgtgctgcctttgaaggctcaccacagctgcccgactgcacgatgaccctgacagatgtcttcc 

tcctggtcactgaggtccatctgaacaggatgcagcccagcagcctggtgcagcggaacacacgc 

agcccagtggagaccctccacgccggccgggacactctgtgctcgctggggcaggtggcccaccg 

gggcatggagaagagcctctttgtcttcacccaggaggaggtgcaggcctccgggctgcaggaga 

gagacatgcagctgggcttcctgcgggctttgccggagctgggccccgggggtgaccagcagtcc 

tatgagtttttccacctcaccctccaggccttctctacagccttcttcctcgtgctggacgacag 

ggcgggcactcaggagccgctcaggttctcccaggagtggacgccccctgcgggggcagcgacca 

cgccctgccatcctcccctcctcccgttccagtgcctgcagggcagtggtccggcgcgggaagac 

ctcttcaagaacaaggatcacttccagttcaccaacctcttcctgtgcgggccgttgtccaaagc 

caaacagaaactccttcggcacctggcgcccgcggcagccctgaggagaaagcgcaaggccctgt 

gggcacacctgtttcccagcctgcggggccacctgaagagcctgccccgcgttcaggtcgaaagc 

ttcaaccaggtgcaggccatgcccacgttcatctggatgctgcgctgcatctacgagacacagag 

ccagaaggtggggcagctggcggccaggggcatccgcgccaactacctcaagctgacctactgca 

acgcctgctcggccgactgcagcgccccccccctcgtcctgcatcacttccccaagcggctggcc 

ctagacctagacaacaacaatctcaacgaccacggcgtgcgggagctgcagccctgctccagccg 

cctcactgttctcaggtgaggccgccaggcaaggggagcaacaggtgggccgggcgggccaggct 

cggagggcatcgggaatggcatcacggaccaggatcccccaggactcatgaccatggcccctgga 

atgcccagaccttttctttcttagcagggcagaggtcaaggtgcaaagcttcgaggcaggtggac 

ctggatcagccacagctgggtgcccttgaacaaagtgcttaactctcagagcctccacgccctca 

tctggaaaaagaagatgctcataatcctatcaattatggccacagggaccaatgttagttgagaa 

cgggtgaagtgcattacaaatattacctaatggaatgctctttacaaccctgtaacttaggcact 

gttattgtctctattttggcagataaggaagtagaggcacagagaagttaatagcctgctttagg 

tcacacagcccagacatagcagtgccagaatgcataaagaaccttcctttcaagattaatgtaag 

gccccgagatagccctcaaaaagtttctggaatatgggagcttttactactgcagagaaagcaga 

ccctgtgccagctggcactggtgacttcctgtgatcaacgctagcagcccttcacactgctagag 

acctcagttaaaatgctgactcgtggccgttttcctgttccatagcttacgggaaacagagccca 

gtctgttttcctctaccagcatttcctacgtaaaacaaaccttgtaaatcuccacagggggttaa 

acttgccattacttgactcatgcaccrccaaaaagcagtagggatttggaactgacncccagtgc 
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ctgtcacaccagtgtcagagtgtaaataattgcatggggacatggggtgcagggggtcgaaggct 

gccctagcctgggaattggaaaacctggagtctgttctctgtactctcagccagtgactctccct 

ctgtagccccaggcagtctcacactcagtgccaccccctgtccatcttttttttttctcccccaa 

acggagtcccgctctgttgcccaggctggagtgcagtggcgtgatctcagctcactgcaacctcc 

gcctcctgggttcaagcgattctcttgccccagcctcctgagtagctgggattacaggcacacgc 

caccatgtccggctaagttttttgtatttttagtaggacggggtttccccatgttggccaggctg 

gtcttgaaatcctgacctcaggtgatccgcccgcctcggccttccaaaatgttggggttacaggc 

atgagccgccgcacccgacccctctgtccatcttttcaatgggaaactccacaccagtgtggtgg 

ccctgcccttcctgctgtccccaggtgaagctttccctcacaccagtgcaagaaaaaacagcttg 

taggaaagcagaggatatgggtaaccacgggaagcacactcagttctctggctgcatcagttagg 

attagttttagctgagagcgaaaaccccaaatgttggtgagttacaagcttatttccctcatgta 

aaagtctagaggtaggtagttcaggactggtatggagtctccatgaccctccggagcccaggctc 

tcttctgccttcctgttctgccatcctcactacccggctttcccatcttggcccaagagggctgc 

tcaaactccagccatctagtcgacactctagctatcagtaagaaggaagggcaaagattgagagc 

atgcctcaatcttttaagaacacttcttggctattactaattatattgctgcttagatttcagaa 

cttaatggtatgggcagaatttaatgagatgggcccagctaaaagatgggggaatctattgctaa 

gaaagtatagatattgggaatgtctagcagcctgtgctgtcntgggctggccatgccatgtacat 

acacactatttcccagcaccaagctggggactctgagggaaagggtccagagtgtctgacttgat 

cattttgatgtggcctaaaaatcaagcttttaattgttcagccttttacttgttatcaaggtcag 

cttgtgggtctaactgggcccaaggcctrgtgtttctaagtaaagctttattggaacgcagccata 

cccatttatttacttactggcngcctcacactacacagttgagtagctgtgacagagaccacacg 

gcccacagagcctaaaatatttgctgtctgacaccttacagaatgacatgagcagtctcctttga 

cagtgggactcacagccttttccagtgacaaatcagggttagcccatgtgtttctggatgggggg 

aagctgttggcattttgggtataacagttcttgtgagacctgnccagcattttgcaggacaccta 

acatcattggccctgcctgcaagatgacagggcactccctcctccagtcacaaccactaaaagca 

gcccctgacatttccaaacccatgccctccaccacacgagaaccagg.tacagggtctggctgaca 

cataggtcacacgcaaagggtggatgtcagaggtggctggcctcacacgtcctccctgtgtcctt 

cacggtcgtgtgaggagccaggggctgtgctgcagcctcgctcatgggctggtgcaggatgggtc 

cggcggccccacgttggccaggccttgtaaggggctatttggctgattgctgtggccattctcca 

ggggcgtctatacctgagaaaactccagggcctgaaggcttctggatctttgtaagattaatggt 

ccttcataatgagtgcctgccccgactcgtaatttttttgctgttttatttcagactcagcgcaa 

accagaccactgacggtggggtaaaggtgctaagcgaagagccgaccaaatacaaaattgcgacc 

cacttggggtatgtctttctccagaacactgggccaactacctagtaataatacagagccgcagg 

gaattcacattcccataggtccctggatgatcggcacggatggcccagggctgggaagagcgctg 

gcccaggagttgagagccccgggttccctrtgtggctcggccagtcatgaagtctcgctgagcct 

cagcctcctcacctgtaaaactgggancccagtataggcaagtaggcttacaactggttattggg 

ggatgcaacgagaatataaggggacatatttaataaatgctagaatcctgtttacatatcagtct: 

ggactaccttgggtccataatcccncatccagagcctttggggcaagacccgaacggggactctg 

agtgcacgctatggcatgacgtiggccgcaggggtctaaggcagtgccccattttcaaacactttc 

acatttctcccgcagaatgtatgaaacagtcaaaccaagtgtggtaagaaagactataagtagct 

ccacatcagttgccaaaagaattgtgagaaactttgggcattcagagcctttgaggttttggagt 

ctgagagaagggattgcgggccagccccacacaactggcggctctgcaagctggagcagctgctc 

agtnncctggggcctcagtggccttcgatgttaatgaggacatggacgcaaacgaccccgggcca 

cactcggctccagggctctgtgtggctgtggaaccctggaagcctgagcctagctgcctttcaac 

tcccatccgctgtactattgaattggcattgagcggcgagatggctgaaaggtagacatcgagaa 

gctttaacatncagaatcttttcttcccaagacgctgaatgtaaccttagttgcaaatacccatc 

acctgccagtcaccgagcactcacgcaccagggctttgcgttatgtcctaagaccctcataacca 

ccctgcaaggggactatcatcattacctctgtattacagatggagaaactgaggcacagagaggt 

aacgtgacttgtctcaggccataaagctggggaaagtagtggagctggttttgaacctgagctgt 

gagaccccagagccccaaactctggcgccngtgtgctcccccctcaacccagacnccggaaacca 

gcagacaccatatgcttcaaaaaacaggggctaccaaaatgacatcaggagccagaaagtctcat; 

ggccgtgctctctcttgaagcttacacaacaaccagatcaccgatgtcggagccaggcacgtcac 

caaaatccrggatgaatgcaaaggcctcacgcaccttaagtaagtggggtaggcaccaggtccct 

cagtatattctcttgaccaccccctcctgttgcccaaagattaaatgtcacagtaaagagctttc 
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gactgggaaaaaacaaaataacaagtgaaggagggalgSltclcgclctgg^ 

^ 999999Cg9 9 ccct 9 cc 9 ttcttttcacac accagtacaccagaa^ 

I"? 9 ^ 93939 " 39 * 99 ^ 9 ^ 
aa """ 99CCCCCCtcccccct9CCCctt 9«^^ 

cgactactctaccccatgtaagtggaatcatacagcatttgccttttggggatggctgatEEcac 

"""" Cttt " 9aca9a 9 tccc 9 ctct 5 t cg=ccaggctggagtgcagtggcatgatctcgg 
""" 9C aacctcC9ccttct 399 tt '= aa 9«=3attctctcgcctcagccacacgag?agctSl 
actataggcacccgccaccaatcccagctaacttttgtatctttagtagaggcggggtctcalll 

rS^frr a ? 9Ct99tCCCaaaCCC " 9aCCtCaaat 9 atCCa «= a CCt^ 

^ a99a "" a ?9 C9C 9 a 9 ccacc 9Cgccccgccaggatttccttcttttttaaggccgagt£aI 

a "" a " 9 = ac 99 ccat 9ccacatttcgtttactcattcatccaagaacagaclltglcltgct 

^^^""^ctgttgtgaataatgctgctgtgcacaCgggcatacaaalgtctcElcaaiS 
"?""= aa " ctctctcctcttt """tt"^ 

a9 "" a 999 taca cgtgcacattgcgcaggttagttacatatgtatacatgtgccacgctggtg 
cgctgcacccactaacgtgtcacccagcattaggtatatctcccaatgctatccctcccccllcl 

aa ^I^I 9 ^ ttCCaaCttCaCCCaC9CCCCCacaaa 99atatgaactcatcattt?ttatggIti 
^" a9 ^ a " CCat9 9 c 9"cacgcgccacactttcttaacccagtctatcattgttggaclEtti 
ggttggtcccaagtctttgctaccgtgaatagcgccacaataaacatacgtgtgcatgtgtcttt 
a r a « a lr a . 9aCtCaCaCtCatCt999CaCaCaccca9CaaC9 99 a tggccg^ 




:ccgacggccagtgatgatgagcatctcttcatgtgctttttggctgcataaatgtcctccttt 
? a I aa ?^^ 9Ct ^ at9tCCCtC9CCCactC,:tt 9 at 9gggccgcttgtttttttcccgtaaat 
ttgctcgagctcattgtagattctggatattagccctttgtcagacgagtaggttgcgaaaattc 
i"" Ca ^ 9CC 9 ta 9gttgcctgttcaccctgacggtagtttcttttgctgtgcagaagcccttt 
agtttaaccagatcccatttgtcaatttcgcctcctgtcgccactgctttcggtgttttggacat 
gaagtccctgcccacgcccatgtcccgaatggtaacgcctaggctttcctctagggttttcatgg 
ttttaggcccaacgcttaaatctttaacccaccttgaattgatttttgcataaggtgtaaggaag 
ggacccagcttcagctttctacacatggccagccagttttcccagcaccatttactaaataggga 
atcctttccccattgcttgtttttctcaggtttgtcaaagatcagatagctgcagacacgcggca 
ttacttctgagggccccgttccgctccatcgacctacacctctgtcttggtaccagcacSacgct 
gttttggttaccgtagcctcgcagcacagcttgaagccaggtagtgtgatgcctccagccctgtt 
ctttcggcttaggactgacccggcgacgcgggctctittttcggctccatatgaactctaaagcaq 
cctcccccaactctgcgaagaaagccatcggcagctcgacggggatggcatcgaatctgcaaatt 
acctcgggcagcatggccattttcacgacactgatcctccccacccatgagcatggaacgttctt 
ccattcgtttgtgccctccctcactiticcccgagcagtggcttgtagtcctccccgaaqaggccct 
tcacaccccttgtaagttggactcctaggcaccttattctccttgaagcaactgcgaacgggagt 
ccacccacgatttggccccccgtctgtctgccgccggcgtacaagaacgcccgcgaccttcgcac 
accgaccccgcatcccgagacctcgctgaagccgcccaccagcctaaggagaccccgggccgaga 
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cgatggggttttctagataaacaatcatgtcgtctgcaaacagggacaatttgacttcctctttt 
cctaactgaataccctttatttccttctcctgcctgattgccctggccagaacttccaacactat 
gttgaacaggagcggtgagagagggcatccctgtcttgtgccagttttcaaagggaatgcttcca 
gtttttgcccattcagtacgatattggctgtgggtttgtcatagatagctcttattattttgaaa 
cacgtcccaccaatacctaatttattgagagcttttagcacgaagggctgttgaatcttgtcaaa 
ggctttttctgcatctattgagataatcatgtggtttttgtctttggctctgtttatatgctgga 
ttacatttattgatttgcgtatattgaaccagccttgcatcccagggatgaagcccacttgatca 
cggtggacaagctttttgatgtgctgctggattcggtttgccagtatcttattgaggatttttgc 
atcaatgttcatcaaggatattggtctaaaattctcttttttggttg.tgtctctgcccggctttg 
gtatcagaatgatgctggcctcataaaatgagttagggaggactccctctttttctattgattgg 
aatagtttcagaaggaatggtaccagttcctccttgtacctctggtagaattcggctgtgaatcc 
atctggccctggactctttttggttggcaaactattgattattgccacaatttcagagcctgtta 
ctggtccattcagagactcaacttcttcctggtttagtcttgggagagtgtatgtgtcgaggaat 
gtatccatttcttctagatcttccagtttatttgcgtagaggtgtttgtagtattctctgatggt 
agtttgtatttctgtgggatcggcggtgatatcccctttatcactttttattgtgtctacttgat 
tcttctctcttcttttctctat'tagtcttgctagcggtctatcaattttgttgatcctttcgaaa 
aaccagcrcctggattcaccgatttcttgaagggttttttgtgtctctacttccttcagttccgc 
tctgattttagttacttcttgccttctgctagcttttgaatgtgcttgctcttgcttttctagtt 
cttttaattgtgatgttagggtgtcaactccggacctttcctgctttctcttgtaggcatttagt 
gccacaaatctccccctacacactgccctgaatgcgtcccagagattctggtatgtggtgtcttt 
gt.tctcgctggttccaaagaacatctctatttct-gccttcattccgttacgtacccagcagccat t 
tcaggagcaggttgttcagcttccacgtagttgagcggccttgagtgagattcctaatcctgagt "? 
tctagtttgattgcaccgcggcctgagagatagtttgttataatttctgttcttttacatttgcc * 
gaggagagctttacttccaactatgtggtcaattttggaacaggtgtggtgcggtgctgaaaaaa 
atgtatattctgttgatttggggtggagagttccgtagatgtctattaggtctgcttggtgcaga 
gccgagtccaattcctgggtatccttgttgactittctgtctcattgatctgtctaacgttgacag > 
tggggtgctaaagtctcccattattaacgtgcgggagcctaagnctccttgtaggtcactgagga ? 
cttgctctacgaacctgggtgctcctgtattgggtgcataaacatttaggatagttagctcctct ■* 
cgttgaattgatccctttaccattacgtaatggccttctttgtctcttttgatctttgttggttt ^ 
aaagtctgtttcatcagagactaggattgcaacccctgcccttttttgttttccactggcttggt: .* 
agatcttcctccacccttttattttgagcctacgtgtgcctctgcacgtgagatgggtttcctga 
atacagcacactgatgggtcccgactctttatccaacttgccagcctgtgcctttcaactgcaga 
attcagtccatttatatttaaagttaatattgttacgcgtgaatttgatcctgccattatgatgt: 
cagctggcgattttgctcattagtcgacgcagtttcctcctagtctcgacggccnttacactttg 
gcatgattttgcagcggccggtaccggtcgtccctttccacgtttaccgctcccttcaggagctc 
ccttagggcaggcctggtggcgacaaaatctctcagcatttgcttgtctacaaagtactttactt: 
ctccttcacttatgaagctcagttcggctggatatgaaattctgggttgaaaattcttttcttta 
agaacgttgaatattggcccccaccctctcctggcctgtagggtttictgccgagagacccgctgt 
tagtctgacgggctttcctccgagggtaacccgaactttctctctggccgcccttaacatttttt 
ccttcatttcaactttggtgaatctgacaattatgtgtcttggagctgcccttcccgaggagtat 
ctttgtggcgttctctgt at t tec tgaatctgaacgttggcctgccttgctagattggggaagtt 
ctcctggacaatatcctgcagagtgctttccaacttggttccattctccacaccactttcaggta 
caccaaccagacgtagattcggtcctttcacacagtcccatatttctcggaggccttgctcattt 
ctttttatccttttttccctaaacttcccttctcgcttcat'ttcattcatttcaccttccattgc 
cgatacccrttctticcagctgatcgcatcggctcctgaggcctctgcattctccacgtagttctc 
gagccntggcttccagctccaccagcccctttaagcacttctctgtattggttattctagctata 
cactcntctaaat'ttttttcaaagctctcaacttctttgcccttggtctgaacgtcctcccgtag 
ctcagagtaatttgaccgtctgaagccct-cCDctctcagctcgncaaaa'ccattctccatccagc 
tttgttctgttgctggtgaggaactgcgttcctttggaggaggagaggcgctctgcgttttagag 
tttccagtctttccgttctgttttttccecatccttgcggttttacctactcttggtctttgacg 
' acggtgatgtacagatgggcrttcagtgtagacgtcctttctggctgttagccttccttccaaca 
gacaggaccctcagccgcaggcctgttggaacaccctgccgtgtgaggtgccagtgtgcccct'gc 
tggggggcgccccccagttaggccgctcgggggccaggggccagggacccactcgaggaggcagc 
crgcccgccctcagatcnccagccgcgcgctgggagaaccactgcccccttcaaagccgccagac 
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ggatcacIIggtaattfftafttttttgSgaatcatr^ 
ttttacgttllcaccaacagttcattctaSSctcc^ 

c C tggaaacfatgaaaK'a"aataa a catcttct^ 

tcagttaagctcccccactttctctcSttccc^ 

gggcgagacnatatgcatctacactatagtaggggtgagalt^^^ 



„;X;?:=r**" 9C39ccct 9 cc 9 tcccca 939gocaacct g gE;cctcccagociccc£aQa 

c=t!gtKiicSS?s;r 9c ^ cta " a9a ^^ 

gcga FF^^ 

!!?"f"?^ ct = c ""9ttcaactcccictt a c=IIto a9 Hc!tac!^L E ?™?^ C ^ 



gpacngctgggtcaaacggtatttccggtcccagatccctgaggaaE^acc^aaqta^tatt 
«?«cagtgaactctgcacgtgtcccacaccagccaaccalca1c"^^ 

-pl^=^ 9aC9Ctggcca9CagacCccc g cccctc =cttgtcctcaggacqacqbtaaa 

"Jjaaggtggggacagtagcagaagagacc^ 

gc «gcct t cagaaggacagcttgccaggcc t cgccag?c C =c?gc^^^ 
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gtgccgaccccagcccactgttgtgngagcaggctgtgctgatgacccatttcccgtccagcctg 

cccttgtgctctgtgtgtgggctctggggcagcagcgcctgggcactactgctgcagctgaacac 

ttctgcatccngccccgagtgagcctgggccggggccacagccaggcagaggcttcccagctgtt 

ctgatgttgaagctaagattgaatgtagatgtgtctttaataattcaccccaagtgtgttcctcc 

ctagtcttgcgtccaacggcatctccacagaaggaggaaagagccttgcgagggccctgcagcag 

aacacgtctccagaaatactgtggtaatagctcgagtcatttcatttgtttgtttgtttttctgt 

gatagggtcttgctctgccgcccaggctt-gagtgcattggtgtgatctcagctcaccgcagcctc 

cacctcccaggctcattcgaacctcccgccttggccttccgagtcctgagactataggcatgcac 

caccacacccagccaattttaaaattttttgtagagatggggttttgctatgttacccaggctgg 

tcttgaactcccgggctcaagcagttctcctgccctggcttctcaaagccctgggattgcaggtg 

tgagccactgcacctggcacagagncactttggagggtttaggtcccaggaattatcccaggggc 

tgcacatggcctggaaccttaacagaaaaggtgtctcccaattggaaaggctctaggcctttcag 

ttaagttgataatttcctcctagagaagagaatagccacttctacaagcataaacaggtacagga 

ggaggaagtgggctccgggagcctggatctgaggccttggccttctaggccccaggagaaccaga 

acgctggccatgcaagctatccaggcatcctitggataccttcagatgtgcttagcagaggccaac 

ttccacacacttggctcaaaattttctcccttcctcctcttcatctgccttcccccaggcagcct 

cctccttccccaggtcttcacatcagggtttggcc.tttatgctccatccagctcatctgtcactt 

gtcacctgaagcccacagtccccgctccctctctgcactctagggcacttactaagtggatgtgg 

cctcccgagagtgttttttgttggcgctcccttttttatggccacttaatgttttafetttgcttt 

atntgtatttacatctctgtatcataaattccatacaggtggctgggagcagtgacrcacacccg 

taatcccagtactttggaaggctgaggtgggaggatcgcttgaggccaagagttcgagactagcc 

tgggcaacatagcgagaccctctatctacaaaaaaaaaaaacattccttacaggttaagtgaggg 

agt.cgtatcacaaccctccctaccatctact:cagagcccagtgctcatttgatcttgctaaatta 

gttactgagaataatgacaatatcctcttcatgagagagttttgacattaggcctgctgtccagt: 

aagtgcattttaaattctttcccctcaacaaaccatttaacattttgaaaagtagtttatgtttt 

ttggaaaaaatgtaagacactaaaggaggacatgaaagtacctcctaaagttcctgctaaaagga 

ggaagtgaaagtacctccctctgcgccttccaaaacaacctttccttcctagccttttgttctat 

gtatgttcaaagatatgcaaaacagaatagcattcaagcagtggctctaaaaatatcgtaatcac 

atacctcacatgtctcctctagggcttccccaccttgatgctgt:tgacattttggt:ccaagcgat 

tctttattatggtagggccgtcctgtgcaccatagacggtttagccgcatctctgccctgtacct 

cccagtggtgaggatcaaaaatgccLccggacatggccaggcgccccatggagagtgaaatcaca 

cggatagcagcaatgt-caacacccagaagccctcaagtgctgaccgcatgccatgtgttattcta 

cactttttccccgtgttaactcactcagcctcacaaccactctatacgatctctactgttaacgt 

tcaccagtgagaaaactcagacccaaagaacctaagcctgttgcccgaggtcaccctgctggcgg 

gcgacacaaacctgcccaggctgagcccggagcagatgrcaatgctgtgttcttctccctcccca 

ttctacctcattctccctacaagctgcacaacatctcgaatagatatcacaatacatttcatcag 

ttgtttctgatctaaatttgttcagattttacattaggataataccacaatgcatgctgcaacgt 

ataaagctttgcgtgtatatccttgcacactgcagggtaaatttctagaagtctgattgtcttaa 

aatgaagcacattaaaaacttgggcaggcacatccaaactgcccttcaaggaatcttcctttcta 

aatgttctttctgttctattcttcttcctaatgattctttcgtccactggcacaagcgggtccta 

ccctgtttacaccaaggagctttggcgctttatccagaccacttctggttctaaggaccatcgag 

agacttcctgaactttcagtcacttaacttgggtccctcacaagttaactgagagcaaagtactg 

aacacattttaatgtgcagtcagtgactgtctcaggccttcaaactaacttggacaacacactgt 

cagtggtgttcaagggaccccgggactagaggagaactgagaagcaggcattggccccttgtttt 

ccgtgggcccccatcttccatgaaatctgagggctcagcaaaggtggggagggagggtgggctcc 

tictacaggt age cgggccaagaaataggagcccaggtacaggatccgcattaaaaat gage ccca 

t tgaccttctgtggggctgacaggctgggcttggagcctggctgttttctgggttctcagcaagt 

gatcatctgcatagctggagagccttgggctgagctcccgctcctgtgaactctaaaacaatgtc 

cgccaagtaggccctcttgagcaaatacttccttttttttccttaggctgacccaaaatgaactc 

aacgatgaagtggcagagagtttggcagaaatgttgaaagtcaaccagacgttaaagcat.ttatg 

gtaacccagagagccttacaattccagaccgtgccacctttcaaaagtatttttcgagataaaat 

ctacatactgtaaaactcaccctcttaaagtatacaattcagaggtttttagtgcaaccatcacc 

acccaattctagaacattttcacccctcccccccactccaa'aaagccctggtatccattaagcag 

tcactccctgtcctcctccccagaccctggcaaccaccaacccgctrtctgtccctatggattcg 
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? a9 ^ gatg99acactt99 9"gtttctactttttgac^ 

S"etattttacattctcgccag2aatgtatgt|ga??f^ 
" a "attgtcca t cttttaaaltctalttata?lag^ 

tttgcatttc C ctgatgacaacaatgt?gaatg t c C ?t??"ItIcc?actg^ 
cttctttggagaaatgtctccatatcctttgcccattttaaaattgggt^ 

t?^ 3 ^ 9 ?? 9 ^"""" 3 "" 9 ^^ 

tttctttctctctgtggagtttttcctctttcttgatagtgacctttaaaaaacaaeaa aa ^ a : 

a caac^^^ tttt9a9atg9a9tCtt9CtCtt9tCa «Sg" 

tSIJS tCt9CaaCCC?Cacctcct ^9 tt «agcgatcctt C tgIItIaicc?cc?S 

ccf? a t^r 99CatCa9CCaCCat9CCt9tetCaCttt 9 citt "taac 

ccatttaggcccaggccggtcctgaactcctgacctcaggtgaCccacctgccEcaallfcncf 3 
agtgctgggattacaggcgaaaagccactgcacctggccLIagtttttaattttaltaaSar^ 

acaggtccgcaacccattttgagctaatttttgtatagtgtaLgtgagggItaacctcat a e a n 
tcgcacgcggatatccagccgtcccggcagcaccacglgltgaalagtfHtc^tfcctattoa 
acggcctcgacacccttgtcaaaaaccaattgaccaEaLtitaCgigtttatttctafattS^ 
t9"«ggtccattgatttata C gectctcctatgccaggaIcat^ 

""tcgaaatcaggaggtgcgagttctactttg^ 

CgggttctttgcatttcttatgaattcagactcaccttgtcaatttctlcaalaagtSaga^ 
a S 

tgaacccaggggccctagcccttgtggggtatggctggiagcIILglfcctggctgcS 
gcctaa1!a?S^?^ 9C ^^ 

c a ??a a « ctt g aaatcccaca =acccagtggacggggggcactcttatagtcattlt 
a 3 ^^ a ^H 9 CaCaaCt9a99CaCa9acagat 9 c 9 tccattCctt =aaggttctgtagctg 

a^cc?^? 9 ! 3 ?? 93999 "^ 93 ^ 

?"" Ctt " ttCaCgaca 9 catag 99 ccctc 9 tcc ccctggt:aagggctLcttcttgigl5L 

of^ a ?^ attCCtaa999CCaCgtCta9ca99a 9 CCCact « a <=aIIIagc«^^ 

?i"" gCgaCgaa 9=99 a 9 a cactacagcctcttgatgcac«^ 

??a a ?ta!a^ aa ?l 9a9CatttCa99a9a99Cet99CttCCCa ^ 

^ a "^ 9aaCcact 9 ctaccca 9 aacaa tggggtgcattctcagagtccccattattaaagcttt 
t " a " gagccccatgagaactattcat 9 a 9 aa c"tttcacggclgcat:aactgtttctlct 



ct 



cccccttgcatgtcggtagcctcttaactttaaaacctgccttgcctttccctagccacctgg 
aa99a9a ^ Ca9aCCtCCC9tCccacg 9 t 9tgtttcttacaatttgttgttcagattggtgg!I 
tcccaaatatatataaaaatataaacggagtctcactctgtcacccaggltggaltgclltggca 

tagccgggagtacaggtgcacaccaccatgcccagccaatttttttgtacctttaatagagacai 
gttggccaggatggcatcgatctcctgagctcgtgatccacccacctcggcctcccaaagtgctg 
ggactacaggcgcgagccactgcacccggccccaaacatcttgattatgcacctctgcagtgaai 
aatgcaaacacacacatcagctcatgtattacattacgttcactataaaaacaaacagaaaattt 
aaaaaatatcaagctatcccttactctagtggatctcacctggacacttttagccagacacaaag 




:atccagaaccagatcacagctaaggggaccgcccagctggcagacgcgccacagagcaacacc 
ggcacaacagagacttggtaagatcccagcgctcgtcacagcaacaacaccagtgaccgtttacc 
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caccaccactgactgtgcaaggcacaacgcagggtggtttctgtttattcctccagcaaccctac 

acagtaatggtattacctctgttttacagaggtagacagaggcccagaccagtgaaataaggtta 

cccaaggtcactacgagagaagctagaattcagcccagaatgcctgattccatattctgtgctct 

cctgccctgggcccccgccctcatctaccttcattgggtgggacgggggaagtggccagtgaaat 

gatttcctagtggaagtaaatccccctgggactcagcaattgagagatgaccgtgttggccagga 

gtttggagctcattcttccccttttctgggttccgcaagacatttccaggctgacttgaactaac 

ctgtgctctttgtctacttcttttttctgctttgagaacttcctcacgctaatagaagaaaaaaa 

gtttgctttactgtgacattgagcgccatgccacctctttcttgcctcccataaggcacagacac 

tccccactcagcagctcccttaacaactcaattgcctgggtgacgtgggactgggtggatgctQQ 

gagaggggccttattaactatgtcctcctttcatgactggggagaattccatagccaattiaaaa 

aaaacaaaaaacagctccttggccaacacaggctccccatacagtgttttttaaactttgcttta 

gaacctgtttggaacttgtcataaaatcgatcagtttggtgaattgcaaccaacaatatttaaaa 

agaaaacagaacagaacaaaatatcaggatgcaatgtgcatggtatgaaagcatcatttcattca 

tcttagttcatgctcgcatgcgagcgggtgtgtgtttgcataagtgttggttcacaacataaaat 

gtaattcttatttagggttgtagacaaaaggtttttttttaaaaaaaacactgttggctaggcat 

ggtggctcatgcctgtaacaccagcactttgggaggccaagatgggcagacctcttgagcacagg 

agtttgagaccagcctgggcaacacgcgaaaccGcgccactacaaaaattagcccgacatggtgc 

catgtgcctgtagtcccagctactcaggaagctgatgtgggaggatggatgcatgggagaccaag 

gctgcagtgagccaggatcacgccactgcaatccagcccgggtgccagagaccctgtctcaaaaa 

acaaaaaagaaaaaaagaaaaacaccatcatagagaatagagcccagatctaaacagacaccngt 

ggcctgtgtgcctgcgaagcccagcctgcccagcagcctgggaagcactggagggcactggaact 

gtttgcatgggtgtttgcccccaggccactccgtctctgccgattcttaagttttgaggacagca 

ggcagagggggagaggaaggagaccgccagactacagaacagttcgcagagcacagttggcttcc 

acttttctccgcagccggtcaggcgggtagtaaagacctacagttgctttaatcctgtcaagttt 

caaaatctgcattgcttccctcttgagggtcaccattcctacacaaggaaccattttagtagggc 

caggagacttcagcttcaaggcctgcacttgcgtcagggtggagaggggaactggccaccaattc 

agagagggcaggacaggcggcatgggtgctggtcttgggagtgtcctcacttaggcccctggctt 

gttctgggagcctccagagcatgctcctccgcgtgtgacttcatgggaccgggctccgagaaggc 

cgtggctttgntggccctgccagggaccgccacaccaggccacagggttgtggttgagctggccg 

gggagccacgttcagggagcagctctgcttggagccaacacttacagagtaagccttctccctgg 

acttgttaactgtactgacacttatttctacctcattcctttctgaaaataacttggaagtctga 

agtcccttgatgagctctgtctttaagaacagaaattagaggtgaacaatgaacactgtaaatta 

cagaaacgtatcccactccagcataacagccttctgcgaggctatcccctccagactgtggctct 

gggagggcggggcctgagtcaaggtcctagggactagtgctgtgtictccatttattccttgaata 

acgaaacgcttgagcatcagggaccgtgctagcaccaaaaatccagtggtgaacaacatggcctc 

acgggttcactgtctagaaagggagaagcacattaaagaaaaaaccatttgcgcaattatttaat 

tacaactgtgatgggtactaccacaaaggggaaggccaagagggaacctgatttagatgaggttg 

cagggaaggcctctctgaggaagcagcacttacactaagccacgaaggatgaataggagctagtc 

agctgaggtgagtattctgcgcagggaacagcatgtgcaaagggtctggggcaggagggagtgtg 

gtgtcctggaagaactgccagaagctgctgtgccccagggttcagacagtgtggaagaggggact 

acaggaggctgaggagataggcagggactggaccataaaagatctgtgggtcacgacgtgcattt 

cggcctccatcctaaaagtgatggaaagtcagtgaacagcttgaagcaggagaggcatgcgatca 

gatctgcaatgcaaaaagaccaactcttggctcttctaggaaactgaattggagaaggccagagc 

acgtggaaatgacctgtcagtaggacattgtactgatgcagggaagagatgatgggtgctcagac 

caagatggccggccaaagacatagaggttccagggaggcattctagactcttaggaattagggga 

gaactttgtgatacaaggaacatggggatgagaaggaaggtgtccaggttgacccccaggctact 

aacctgctcagcaggatgagagcggrccattcactaagccaggggaccctaggaggcgtggctac 

tttgaggtgtgggggagaggcccaagtgaggatgccaagcaggtaactgcctccacggacataca 

aacaaggccgtggcattgatgagaccgggtggggaaaagggcccagccccaaacctggaggaaat 

ctcagatgtagaggtcacatggaggagaatataggaaaggaaattgaagtagagtgcccagatgc 

aggagaaaaatcagcgcatacaaccaagccaaggggagggagcgcctcaagaaggagggagagga 

gaggccaggacagccaaaatcctgagggccaagaaagacaagacctggaaaacgtcattaaattc 

aggcttacggaggctacaggtgaccccagtgagacccagngaacagagggatggcagctggagag 

gacccacgctaanatigaaggaacnatccgcaaagggtacgctccccaaccccagggacacatgtg 
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1 GTCGACCC^CGCGTCCGGCAGCAGGCAGGCT^^ 
^gCTGGGTCC^CAGG^ 

6 3 TGACCGCGGCTGCCCACCCCAG^ 

ACTGGCGCCGACGGGTGGGGTCTCGCTACCCCGCCC^ 

i A^ _ 9 . A A . ? - P- A A . A A - A A 

125 GAAAACTTGTCACXXX^TG^^ 

. CTTTTGAAGAGTCCCCTACTI>3AGTT^ 
13A _E_ _N _ L _ A A _ D_ E_ A _ A A - - K_ M_ A A. A A . A A A . 

187 AGAAGGCTATGGGCGCATCCCACGCGGGGCCCT^^ 
TCTTCCGATACCCGCGTAGGGTX^TC 

33_^_ _ e_ A A . 9. A . L - f _ A .9 , 6- A -A - P. A .9. A .L . 9 _ A A _ . 

• 249 ACAAACTTGTCAGCTACTATCTGGAGTCGTATGGCTT^ 

TGTTTGAACAGTCGATGATAGACCTC^GCATACCGAACCrc 

54AP_ _k _ a A A _ A A A - A A A _ P. A . I. A A .m_ A A . t . A . 

311 GACATGGGCTTACACX3AGCTGGCTGAGC\GCTGCAAACGACT 
CTGTACCCGAATGTCCTCGACCGACTCGTCGACGTTT^ 

75A P_ A A A A . A A . § . P. A _ 9. A A . A A A _ A A A _ A 

373 GGCAGCTGCAGCC^GTGTCCCTGCTCAGAGTO 

CCGTCGACGTCGGTCACAGGGACGAGTCTCATGTCGGTCT'lt j lCCT 

95A . a _ a_ _a_ _ a _ a A . P. A -9 . A A A _ A A .9 . A A A . P. _Q . . 

43 5 ACAGGCAAGCACTCATTGCCAGGGTCACAGAAGTG 
TGTCCGTTCCTGAGTAACGGTCCCAGTXTICT^ 

ii6A H . A . P. A A A - A A A_ A A . A A . A A A _ A A A - A A . 

497 AGTGTXSCTGACTGAAGGACAGTACCAGGCAGTrCGTGCAGAGAC^ 
TCACACGACTGACTTCCTGTCATGGTCC 

137A s_ A A _ A A . 9. P. A . 9. A A . P. A A . A A A _ 9. A .1 - A 

559 GAGGAAGCTCTTCAGCTTTGTTCCATCCTGGAACC 

CTCCTTCGAGAAGTCGAAACAAGGTAGGACCTTGGACTGGACG 

15 7 A _ A A A A_ A A . Y_ A A . A A A _ A A A . P. A A - A A _ . 

621 CCTTGAAGGAAATACATCCCTACTTGGTGATGGACCT 

GGAACTTCCTTTATGTAGGGATCAACCACTACCTGGACCT^ 
,_178^A LKEIHPYLV MDLEQS 



683 AGCTACATTATCTAGCTCCTGACTTTCT 

TCGATGTAATAGATCGAGGACTGAAACATA' l U 'lC ri'AAAAA C 1 ' 1 ' I 1 1 ' 1 ' IG ' l ' 1 ' A AACATAAACA 



745 GTTTAAAAAAAAAAAAAAAAAAAGGGCGGCCGC 

- CAAA i ' i ' ri ' rn ' i ' iHTri ' rri ' i ' ri cccGCCGGCG 
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1 CGCGTCCGGCTGCAGCGGGGTGAG^ 

GCGa\GGCCGAOnCGCCCCACICGCCGCCGTC 

!► M G 



61 GCGCGCGCGACGCCATCTTGGATGC^^ 

CGCGCGCGCTGCGGTAGGACCTACGCGACCTCTTGGACTGGCGG 
3» R A R D A I L D A L E N L T A E E L K K 



121 TCAAGCTGAAGCTGCTGTCGGTGCCGCTGCGC 
AGTTCGACITCCACGACAG^ 
23 ►F K L K L L S V P L , R E G Y G R I P R G 



181 CGCTOCTGTCCATGGACGCC^^ 

GCGACGACAGGTACCTGCGGAACCTGGAGTGGCTCT^ 
43^A L L S M D A L D L T -D K- L V . S F Y L E . 



241 CCTACGGCGCCGAGCTCACCGCTAACGTGCTGCGCGACATGC^^ - - 

GGATGCCGCGGCTCGAGTGGCGATTGCACGACGCGCTGTACCCGGACGTC 
63 ► T Y G A ELTA NVL RDMGL QEM A 



3 01 C^3CAGCTGCAGGCGGCCACGCACCAGGGCTCTGGAGCCGCGCCAGC 

CCGTCGACGTCCGCCGGTGCGTGGTCCCGAGACCTCGGCGCGGTCGACCCTAGGTCCGGG 
83^ GQL QA A T H QGS G A - A P A G I QA 



361 CTCCTCAGTCGGCAGCCAAGCCAGGCCTGCACTTTATAGACCAGCAC 

GAGGAGTCAGCCGTCGGTTCGGTCCGGACGTGAAATATCTGGTCGTGGCC 
103^P PQSAA KPGL HF I DQHRAAL 



421 TCGCGAGGGTCACAAACGTTGAGTGGCTGCTGGATGCTCTGTACGGG 

AGCGCTCCCAGTCTTTGCAACTCACCGACGACCTACGAGACATGCCCTTC 
123^1 A RVTNVEW L L DA LYGKVL T 



481 ATGAGCAGTACCAGGCAGTGCGGGCCGAGCCCACCA^CCC 

TACTCGTCATGGTCCGTCACGCCCGGCTCGGGT^ 
143^D EQYQ A V R A E PTNPS KMRKL 

54 1 TCAGTTTCACACCAGCCTGGAACTGGACCTGCAAGG 

AGTCAAAGTGTGGTCGGACCTTGACCTGGACGTTCCTGAACGAGGA 
163>F S F T P A W N W T C K D L L L - Q A L R 

601 AGTCCCAGTCCTACCTGGl^AGGACCTGGAGC^ 

TCAGGGTCAGGATGGACCACCTCCTGGACCTCGCCICGAC 
183^E SQSYLVEDLERS 

661 TCCGGTCAGCCCCTCGCAATCCCACCA^ 

AGGCCAGTCGGGGACCGTTAGGGTGGTTTAGTAGGACTTAGACTAG^ 

721 ATACGAAAAGCCAGCTTGAA 
TATGCTTTTCGGTCGAACTT 
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ALIGN calculates a global alignment of two sequences 
version 2 . OuPlease cite: Myers and Miller, CABIOS (1989) 

> hCARD5-DNA 740 a a vs. 

> mCARDS - DNA 763 aa 
scoring matrix: paml20 .mat , gap penalties: -12 /-4 

68.2% identity; Global alignment score: 2377 

10 20 .30. 
inputs C GCGTCCGGCTGCAG-CGGGGTG AGCG -GCGGCAGC GGC 

CCACGCGTCCGGCAGCAGGCAGGCTGCAGCAGGCGAGCAGCAGCAAGAGTAAAAGGTGAC 
10 20 . 30 40 : 50 60 

40 50 60 70 80 90. 

inpu t s CGGGGAT CCTGGAGCCATGGGGCGCGCGCGCGACGCCATCCTGGATGCGCTGGA 



CGCGGCTGCCCACCCCAGAGCCATGGGGCGGGCACGAGATGCCATCCTGGACGCTCTTGA 
^ 70 80 - . 90 - 100 110 120 

100 110 120 130 140 150 

inputs GAACCTGACCGCCGAGGAGCTCAAGAAGTTCAAGCTGAAGCTGCTGTCGGTGCCGCTGCG 



AAACTTGTCAGGGGATGAACTCAAAAAGTTCAAGATGAAGCTGCTGACAGTGCAACTGCG 
130 140 150 160 170 180 

160 170 180 190 200 210 

inputs CGAGGGCTACGGGCGCATCCCGCGGGGCGCGCTGCTGTCCATGGACGCCTTGGACCTCAC 

AGAAGGCTATGGGCGCATCCCACGCGGGGCCCTGCTGCAGATGGACGCCATAGATCTCAC 
190 200 210 220 230 240 

220 230 240 250 260 270 

i npu t S CGACAAGCTGGTCAGCTTCTACCTGGAGACCTACGGCGCCGAGCTCACCGCTAAC-GTGC 

TGACAAACTTGTCAGCTACTATCTGGAGTCGTATGGCTTGGAGCTCAC-AATGACTGTGC 
250 260 270 280 290 

280 290 300 310 320 330 

i npu t S TGCGCG ACATGGGCCTGCAGG AG ATGGCCGGGCAGCTGC AGGCGGCC ACGCACCAGGGCT 



TTAGAGACATGGGCTTACAGGAGCTGGCTG AGCAGCTGCAAACG ACTAAAGA - - AGAG - T 
300 310 320 330 340 350 

340 350 360 370 380 390 

inputs CTGGAGCCGCGCCAGCTGGGATCCAGGCCCCTCCTCAGTCGGCAGCCAAGCCAGGCCTGC 



CTGGAGCTGTGGCAGCTGCAGCCAGTGTCCCTGCTCAGAGTACAGCCAGAACAGG AC 

360 370 380 390 400 410 

400 41.0 420 430 440 450 
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inputs ACTTTATAGACCAGCACCGGGCTGCGCTTATCGCGAGGGTCACAAACGTTGAGTGGCTGC 
ACTTTGTC^ACCAGCACAGGCAAGCACTCATTGCCAGGGTCACAGAAG^ 



420 430 440 450 



TGC 

460 470 



460 470 480 490 500 



510 



inputs TGGATGCTCTGTACGGGAAGGTCCTGACGGATGAGCAGTACCAGGCAGTGCGGGCCGAGC 

TGGATGCTTTGCATGGCAGTGTGCTGACTGAAGGACAGTACCAGGCAGTTCGTGCAGAGA 
480 4^0 500 510 520 530 

5 20 530 540 550 560 570 

inputs CCACCAACCCAAGCAAGATGCGGAAGCTCTTCAGTTTCACACCAGCCTGGAACTGGACCT 



CCACCAGCCAAGACAAGATGAGGAAGCTCTTCAGCTTTGTTCCATCCTGGAACCTGACCT 
540 ' 550 " 560 ' 570 ' ' 580 590 

580 * 590 600 610 620 630 

input s gcaaggacttgctcctccaggccctaagggagtcccagtcctacctggtggaggacctgg 

GCAAGGACTCCCTCCTCCAGGCCTTGAAGGAAATACATCCCTACTTCK3 

600 610 620 630 640 650 

640 650 660 670 680 

inputs AGCGGAGCTG AGGC - TCCTTCCCAGCAACACTCCGGTC - AGCCCCTGGCAAT -CCCAC - C 



AGCAGAGCTGAGGTATCTTTTCCAGCTACATT— ATCTAGCTCCTGACTTTGTATACAC 
660 670 680 690 700 710 

690 700 710 720 730 740 

inpu t s AAATCATCCTGAATCTGATCTTTTTATACACAATATACGAAAAGCCAGCTTGAA 

AATTTTTGAAAAAACAATT -TGTATTTGTGTTTAAAAAAAAAAAAAAAAAAAGG 
7 20 730 740 750 760 
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ALIGN calculates a global alignment of two sequences 
version 2.0uPlease cite: Myers and Miller, CABIOS (1989) 

> hCARDS -protein 195 aa V5 t 

> mCARDS -protein 193 aa 
scoring matrix: paml20.mat, gap penalties: -12/ -4 

71.8% identity; Global alignment score: 712 

10 20 30 40 50 60 

inputs MGRARDAILDALENLTAEEUCKFKLKLLSVPLREGYGRIPRGALLSMDALDLTDKLVSFY 

MGRARDAILDALENLSGDELKKFKMKLLTVQLREGYGRIPRGALLQMDAIDLTDKLVSYY 
10 20 30 40 50 60 

70 80 90" 100 : - 1 liO . - 120 

inputs LETYGAELTANVLRDMGLQEMAGQLQAATHQGSGAAPAGIQAPPQSAAKPGLHFIDQHRA 

LESYGLELTMTVLRDMGLQEI^EOLQT-TKEESGAVAAAASVPAQSTARTG-HFVDQHRQ 
70 80 90 100, 110 

130 140 150 160 170 180 

inputs ALIARVTNVEWLLDALYGKVLTDEQYQAVRAEPTO 

AL I ARVT EVDG VXiDALHG S VLT EGQ YQ AVRAETTS Q DKMRKLF S FV P SWNLTCKDS LLQ A 
120 130 140 150 160 170 

190 

inputs LRESQSYLVEDLERS 



LKEIHPYLVMDLEQS 
180 190 
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1 5::S~^!?^ CI ^ C ^ 



66 ^TErrrr:^ 1 ^^^^ 



nCT ^ CTA ^^GGACCAAACAT^^ 
J-t _M . A _ _S. _ E _ G_ _A_ _ S _ _S_ J 

------------ - K - -°- - ? . J. - K - - 1 . . L . .S. . V _ _L_Q _ Q_ _ D _ _P , D _ S_ I L D 

-.iV-.I.J-. J. .§_ - B -- R ..!:.J..S.i. E..E.Y..E..I.L.A.A..i..T _D__. 

326 CTCTGAAGAAMGCCGGAAGCT?ITAATTTItaTCC^ ... 

cac^tctt^^ 

- - =- J -'- P - A . K. K. _S . R_ _K_ A - i. A A . J . P. .* . K_ _G _ E _ D_ _S_ _ C. C. _C . _ 

AAGGAC3TrCACAGACAGAT^ACGGAAAGGTGTCAGTCGAAGGTGGAACCCAAAT 

- - Z 5 -\ f . A. . <« . C. _L_ _ S _ N_ A. _ F _ _P_ _q _ § _ A_ _S . T _ L_ _G _ L _ K_ _Q _ _ E _ _V _ 

..96>..P. R..Q.G_ T__G.§. V..V_E. _VJ5.R_ G.A.f. P.. P. F S L 

■ 521 GGACCATAACCCCAGAAATAGCAGAGCTCTCAGMGAGAAAGAATGCCCGGGTCIGGGAGCTI^CG 
. ii"P_ .I.1.J..P. E. J. .A _ E. _L_'_S . E. _E_ _K _ _E. .C . P _ G_ _L . G. A_ _P _ _ 

CTCAAGAAGTGGACGTrcCTTICGTCGGTGTCCCTIXKJCCTTCATGGAAGAAC^ 
. M 0 .*. - E . A . F . T_ _C . K _ E S. . S _ H_ _fl _ E _ P_ _E_ . V _ _P_ _S _ W_ E_ _N _ Q E 

1ilk ^CG^CCACGTGTCGTTCAG^^ 

i 6 . 1 .'. . V . R. _G . A _ Q _Q _ y _ J _A_ P R _S V K G V E Y E V P 
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716 CAAGTATCTCCCTCTTAAGCGACGGGCAGAGATACGAGGAGCCAGATGA 

GTTCATAGAGGGAGAATTCGCTGCCCGTCTCTATGCTC 
183 ► A S ' I SLLS D GQRYEEPD DSLYL E 



781 GAAGGGGAAGGT GAAGA GTCTCTTGGGTACCCTGAAGATC 

CTTCCCCTTCCACITCTGAGAGAACCCATGGGACTT^ 
205> E G E G E E S- L G - Y P - E D . V L -E " E G A- G D 0-- 



846 CCC ACAGTX^XTTTTGTATATGATAGTGA " 

GGGTGTCACGAAACATATACTATCACTCCTGCTTA 
226> PQ C F VY DS E E E C E Y E E NMG S S 



911 GTGAAGACAGTAGCTGCGACGACACTTCAGAGACCTG^ 

CACTTCTGTCATCC^CGCTGCTGTGAAGTCTCTG^ 
248^G E.OSSCDOTSETCVPLEGEKSA 



976 GAAGAAAGAAAAAGAGTGTTTXIAACAGGTCCTGTC 
CTTCTTTCTTTTTCTCACAAA.GTTGTGC^ 
, ?70_>_ E_ _E_ _R KR-VFQHVL SCL NM DRNRKL 



1041 TCTC C CA ^GTTCGTGAGGCAGTTTTCCATAGAC CGAGG ATGTGAGTGGACA 
AGAGGGTCTCAAGCACTCCGTCAAAAGGTATCTGGCTCCT 
291> LPE FVRQFS1 DR G CEWTPKTP 



1106 GAGACTTAGCTTGGAA' i ' 1 ' I V." 1 ' I GATGAAAGTTCAGGCTTTAGACTCGACAGC CAGAGATTCT ATC 
CTCTGAATCGAACCTTAAAGAACTACTTTCAAGTC CGAAATCTGAGCTGTCGGTCTCTAAGATAG 
_313>G DLAWNF LMKVQA L DSTA RDS I 



1171 CIXIAGGCCCGAGGTGGCGGGTGAAGAGAATXIAAGAATTGCCG^ 

GACTCCGGGCTCCACOXrCCAC^^ 
_ 335> L RP EV A GEENE EL PA G I EKLGI 



1236 - TGGAGACCCCCAAACCATCCATCCCCTGGATGTCCTCTGC 

ACCTCTGGGGGTTTGGTAGGTAGGGGACCTACAGGAGACGCGGACG 
356> G D P Q T I H P L D V L C A C M L C A D S 



1301 CCTTXX1AGCGTGAAGTCATGTCZAAACATGTACCAATGC 
GGAACGTCGCACTTCAGTACAGTTTGTACATT^TTACG 
■ 378> SL QREVMSNMY QC QFA L PL L L P ' 



1366 G ATGCTGAGAACAACAAAAACCTCTTAATGGT AGGGGCCATGA^ 
CTACGACTCTOTrrcrTTTTrcGAGAATTACC^ 
400> OA E NNK NL LMV GA MK DL KQPSA 
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<15 



Ms 



1431 ££*"!°F CK ^ 

. - . 521.% . 9. S_ _S . G_ G. _P_ _ L _ R. _E _ T_ _D _T _ F_ _L_ _G _ L_ _T_ _K _ M. .P. .V _ _ 

1496 TCTCTITICTKGACTftGGW 
...5V.»J..S _F_ _V_R_l_ A.9.C..S..P. S..K _S_ _R__I__V_ N T L L S 

1561 TSxTSx^^^ 0 ^ 1 ^^^ 

_ .^^TCC^ipGGTATGGGCTAAAAGGAGGTAGTCCTAGACAGAa 
...S&V.S. A.Q.9. A. P. Y.. P ._L. F__L .H.Q..D _l _S__V P V L P R 

1626 s^^' nTc ^ c ^ CT2G ^^ 

.-.2t 6 .*..9. J_.S_ _D_ _G__L_ _ V_E__V .J__W__C_F_ _P__D_ K_ I L K E S 

1691 ;^T5S^ 

. - - 2° 8 -\ p . _A__F__Q_K_ P..V.A..V..A_N..L__R_ G__D_L E S F W I 

1756 CAATTTGGTTrCCTGGTAGAAGTTTCCTCCGGTCTITICTITITC 
• - - 2 3 A\ 9. _ _ _ 5 _ f . A . V _ _E_ _V _ § . S_ _G _ L _ _F_ F_ _ F _ T_ _D _ C_ _L_ _G _ E K 

1821 

.--55.1/.. §. W. A A- J-.." _f _J-..G.E_ P_T_2__E._R_C._Y_F I L S 

1886 

.--§ 7 A\ P --Q.A. A-l. S. E..|_A__Q____ F__Q .R.J__L__K_ L_ _K P S Q 

GATGACAAAACCCTTCGACTCCTTCGACCCCTATCTTCCTTCTGATACCTCCGGGAAG^ 
. - - 595_»_ L_ .L_ _F _ W _E_ _A _ _E_ _E_ _A _ _G_ _D_ .5. A _K_ _T_ _M_ _E_ _ A _ _L_ _Q _ A _ _A_ _ 

2016 C^CAO^TMTCTCCT^ 

^C^CTTCATTACAGGAGAGCnGAGTCTACACACAGGG^ 
- -- S16_>__L_ Q..E._V__M__S_S__P__L_R_ C V S L E E M A S L A 



2081 uGGAGCTGGGCATTO^AGACCAAGAC^^ 
„ ok CCCTCGACCCGTMGTCCATCTGGTTCTCAAACTTCAATGAGTICTATAAGTI^ 
638* R ELGl OVDQDFEvTODl QVSPT 



BNSOOCtD <WO 010OB26A2 I 
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2146 AC\GTTCAAGGT GAAAA CC^ 

TGTC^ACTTCCACTTITGGTTGTT^ 
-660>_J_V EGENQQ-PCSQTKSPAESGAO 



2211 GGAGCCAATCAGAGAGCCAGGGGCTCAATGTGACGACAGTCAGAAT^ 

CCTCGGTTAGTCTCTCGGTCCCG^^ 
- §§A-*_ -§.. p 1 REPGA QCDDSQNA PVFHQ 



2276 CTCC^.GTATACATGCCTTATCCAGCA.CACCCATGGGCTTTGG^ 

GAGGTCATATGTACGGu^TAGGTCGTCTGGGTACCCGAAACCGGTAGT^ 
_ 703>T PVYMPYP A H P W A LA I K A G * G N F 



2341 TACCACGTTCCTTTGAATGCCCCCrrGG 



ATGGTGCAAGGAAACTTACGGGGGACCAATACCCGAGGGTGAAACCTAGTG # 
. Z 2 A>_ Y_ _H _V _ P_ _L_ A_ P _W_ L WAPTLDHSRGLS * * 



2406 TGGTTTTTTCCATTCCCATGCTAAACCCACTCAC 

. ACCAAGAAAGGTAAGGGTACGATTTGGGTGAGTGAGATTCCGGAAG^ f 
_74 6>GSFHSHAKPTHSK AFQANCHH I 



2471 CCCATCCCTCCCATGCTAAACCC ACTC ATGTGAATCCCTCTCATC^AACCCCACTCATGTGCAG 

GGGTAGGGAGGGTACGATTTGGGTGAGTACACTTAGGGA 
, 768/P HPSHA KPTHVNPSHA NPTHVQ 



253 6 CCrTGC\TGCTAAACCC\CTCACTCTAAGGCCTTCCAAGCT 

GGAACGTACGATTTGGGTGAGTGAGATTCCGGAAGGTTCGATT^ 
- 2%°?- f. CMLNPLTLRPSKLNPLPLRPL 



2601 TGGAGCCAAGCTAACTGCAATCATGCCCATCCCTCCCTTGCT 

ACCTCGGTTCGATTGACGTTAGTACGGGTAGGGAGGGAACGATTIXX3GAGACT 
-§y.^. KLTAIMPI PPLLNPLI Rl P 



2666 TGATGCTAACCCCACTCATGTGCAGCCTTC^ 

ACTACGATTGGGGTCAGTACACGTCGGAAGGGTACGATTTGGGCG^ 
- § 3 A*- L _ .M.LTPLMC S LPMLNPL I Y S L P K 



2731 C AAAA CCCTCCCCATCCCAATCTACTGCAGTTCAC 

GTTTTGGGAGGGGTAGGGTTAGATX^CGTCAAGTGCCG' l\Sl\J 1T1TJGAGTCGTCAGGGTCAGATT 
_ §55_>_ Q_N_PPHPNLLQF"TA HKPQQSOSK 



2796 GCCTTCTCAGC^GAGACCCAGTCAGCCTAAATCATTCCA 
CGGAAGACTCGTCTCTGGGTCAGTCGGATTTAGTAAa 
876> PS QQ RPSQPKS F QTK PSGA RA 

BNSDOCiD .WO 0t0Oi?6A? i * 
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2861 ^^CCAAGAGCAGGGAGACGTTAAAGAACATACTCTGGAGATCTOSGAA^ 
- S . _ _ _ J2 P " A G R R 



2926 TTGCTTAAGTATirmTTC^TATAGCM 

-^?^n^jAAq^ 

2991 ;^£^ 

JTCGTTTTGGGT^ 

3056 

-^^^T^rr^l^AGACCGACCC 

3121 AGGGCTCTGTCATCCCAACTCAGAGA^ 
JpCCGAGA(^GTAC^7rcAG^TC7^ 

3186 AC^TAACAACGACACAAGCAAGTCCTACATCAGCCATACTACAA^ 
JPT ^^T?E?TOT^^ 

3251 AAAAGGGTIGGATCTAACATCAAACCAAAGAAATC 
JTTEPC^OT^JTGT^ 



3316 ^CTCAGTGGGTTA 
JPTC^GT(^_CCC^T^TC^ 



3381 GTTGGGTGGGTOTAGGCTCAACA 
i^CCttCC^C^C^ 



3446 ACAGGTTATGGGACITGTCAAATCCATTAAATO^^ 
JPTCC^TAq^T^CAGTyTAGCT 



3511 ACATTAAGCTCCAGTGAGTCGC^^ 
^^J7? G J^?9^f^ 



3576 TGGGGTGTCnriX^TAG 
3P?9^^^£™TCGGG^ 



3641 -GTTCAOriTATCTGTGTrc^ 
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3706 GCCACTCAGCAGATACTCTGCAC^TTGGTACAGT^ 



3771 ATCTACCACATITATCCCTTACAGGCGGAAATAATGAATGATCAGCAAAA 



3836 TAACCTTGGTACTGATTTGTATATGTATCAT^ 
- -'- - ATTX^^CCATGACTA^C^TATO 



3901 AGGGGTTCTGATATATTAGTTTAATCGTTTGP^ 
. TCCCCM^CTATMAA'^^ 



3966 TAATTCAAAATATTGAGATGAATTTACAAAGGCT 
^JT^CT'I^AT^CT^ 



4031 GTAGACTCATACTGTTCTGAACATTTGGATAGC 
J^TCipACTAJ^C^<^^ 



4096 ATITGATTCAGGTATTTAACCAGAGCTCCTC^ 
J^AAACT^AGTCCATAAATTGGTC 



4161 GCTACATTATCTGAACATGTGGGAACACAACTGTGACCTTACAC^^ 



4226 ATCAAGGTTCAAGCCAGCAGCAC^TAGTGAGACCAGGTC^ 
JTAGTTCCAAGTTCGCT 



4291 AGGAAGATTTTAAAATTTGCCTCATTAAGAAATAAAGTA^ 
J£CTICTAA^T^A^CX3GACT 



4356 C ATV: - i » i UAACTT ATGACTGTTTAA' 1 T 1 ' I' 1 ' 1 UACTTAAAGTTTAA' 1 ' 1 ' 1 ' I ATTATTGT ATGCGTGT 
GT£G^ACTTGAATACK^CAAATTAAAAAACT 



4421 GTTGTATGTGTGTGCACATGTGTGCCACTGCATGTATGT^^ 
j^C^TACAC^CACGTGTACACACGGTGACGTACATACA 



4486 TC TGTTCi ' I' i V CTI^TIT AGC CCT ATGTGTTTT AC C CACTGAGCT AGGC C A CCT A CTC CT A T AAGTC 
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4616 :£^^!^ ATOAK ^^™^^ 

-^^CT^^TMTCOGra^ 

4681 7i*T£T°^ T ™ TOro ^^ 

- - - - ^T ATA ^*TTA^^ _ 

.... -^^T^TT^ac^^ _ 

4811 ™I7^T?^ A ^^^^ 

4876 AAGCG 



4941 ^g ^AGAGMGAAAA TGACTC^grCMG^^ 



5006 



5071 



5136 TGGGTTATAAAACTGCTCTITCTGCTCAAriTCTCCCCTGCTITrCT 

.... .*£^T4tnTSK5S«i^^ 
5201 ™!^ TAAAC ^ C ^ A ^^ 

-^^^ATT^greA^TCAAM U .1 1 LJl 1 M TlTrnTnu XCCC 
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CACGCGTCCGC CGGATCAGAGAGTGCT CCGAGCTGGGTTG C C C CACTGTG CTTCT ATCTG CACTCTCCAACACTAGGC 7 9 

ATCATTGACATGTTAAAGCTTAGCCAAATAGAATT G1 lLiri 'GTCATT Cl 1 1 1 1 1T AACTTTTACTTATTCATTAGGAT 1 S 8 

MAT E S TP S E 9 

GATTTCATAAT AT ATTTC CTGGTTTAGAGGAAACAGGAACA ATG GCT ACC GAG AGT ACT CCC TCA GAG 226 

* I ER ER KKLLE I LQHDPDS 129 
ATC ATA GAA AGA GAA AGA AAA AAG TTG CTT GAA ATC CTT CAA CAT GAT CCT GAT TCT ATC 286 

L. D T L...T S R R L I S E E E Y E T L E N 49 
TTA GAC ACG TTA ACT TCT CGG AGG CTG ATT TCT GAG GAA GAG TAT GAG ACT CTG GAG AAT 346 

VT D L L K KS R K L L I L V Q KKG E 69 
GTT ACA GAT CTC CTG AAG AAA AGT CGG AAG CTG TTA ATT TTG GTA CAG AAA AAG GGA GAG 406 

A T C Q'H F.L K C L F S " "t ' F P Q L A A 1-89 
GCG ACC TGT CAG CAT TTT CTC AAG TGT TTA TTT AGT ACT TTT CCA CAG TTA GCT GCC ATT 466 

CGL-RHEVLKHE.N. TVPPQSMG 109 
TGC GGC TTA AGG CAT GAA GTT TTA AAA CAT GAG AAT ACA GTA CCT CCT ' CAA TCT ATG GGG 526 

AS SNSEDAFSPGIKQPEAPE129 
GCA AGC AGT AAT TCA GAA GAT GCT TTT TCT CCT GGA ATA AAA CAG CCT GAA -GCC CCT GAG S86 

* TV F F S E K E H L 0 L E T S EF F R 149 
ATC ACA GTG TTC TTC AGT GAG AAG GAA CAC TTG GAT TTG GAA ACC TCT GAG TTT TTC AGG 646 

DKKTSYRETA-LSARKNEKEY 169 
GAC AAG AAA ACT AGT TAT AGG GAA ACA GCT TTG TCT GCC AGG AAG AAT GAG AAG GAA TAT 706 

D TPEVTLSYSVEKVGCEVPA189 
GAC ACA CCA GAA GTC ACA TTA TCA TAT TCA GTT GAG AAA GTT GGA TGT GAA GTT CCA GCA 766 

T' I T Y I KDGQRYEELDDSLYL 209 
ACT ATT ACA TAT ATA AAA GAT GGA CAG AGA TAT GAG GAG CTA GAT GAT TCT TTA TAC TTA 826 

GKEEYLGSVDTPEDAEATVE229 
GGA AAA GAG GAA TAT CTA GGA TCT GTT GAC ACC CCT GAA GAT GCA GAA GCC ACT GTG GAA 886 

EEVYDDPEHVGYDGEEDFEN 249 
GAG GAG GTT TAT GAT GAC CCA GAG CAC GTT GGA TAT GAT GGT GAA GAG GAC TTC GAG AAT 946 

SET TEFSG EEPSYEGSETSL 269 
TCA GAA ACC ACA GAG TTC TCT GGT GAA GAA CCA AGT TAT GAG GGA TCA GAA ACC AGC CTT 1006 

S L B E- E Q E K.- S I E E R K K , V F K - D V 289 
TCA TTG GAG GAG GAA CAG GAG AAA AGT ATA GAA GAA AGA AAA AAG GTG TTT AAA GAT GTC 1066 

L L C L N M D R S R K V L: P D F V K Q F 309 
CTG TTA TGT TTG AAC ATG GAT AGA AGC AGA AAG GTT CTG CCA GAT TTT GTT AAA CAA TTC 1126 

SLDRGCKWTPES PGDLAWN F 329 
TCC TTA GAT CGA GGA TGT AAG TGG ACC CCT GAG AGT CCA GGA GAC TTA GCC TGG AAT TTC 1186 

LM KVQAR DVTARDS I LSH KV 349 
CTG ATG AAA GTT CAA GCA CGA GAT GTG ACG GCT AGG GAT TCA ATC CTC AGT CAC AAG GTT 1246 

LD E DS KEDLLAGVENL E I R D 369 
CTG GAT GAA GAT AGC AAG GAG GAT TTG CTG GCT GGA GTG GAG AAT TTG GAA ATT CGA GAC 1306 
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LQR0VMSN MY0COP 1 r n, 

TTG CAA CGC CAA GTC ATG TCA AAC ATG TAT CAG TGC CAG TTT GCT CTT CCC CTC CTA CTG IS, 

CCA CAT CCA GAA AAC AAC AAA ACC ATC ™ ATC CTC GGG GCC ATG 1^ GAC ATT CTG AAG ill 

AAG CAG TCA ACA CAG TTT TCA GGG GGG CCT ACA GAG GAT ACA GAA AAG TTT CTG ACT CTC »« 

ATG Lc ™ CCT GTC ATC TCT TTT GTC CCT CTA gW TAC TGT AGC TTC TCT AAG TCC AGA "U 

ATC CTC Lc ACA CTT CTC AGC CCT GCC CAG TTG AAA TTA CAC AAA ATC TTT CTT CAT CAA 111* 

GAT TTG CCT CTT TTG GTG CTT CCC CCC CAA ATC TCT OAT GGC CTG GTT GAG ATA IcA L 

CFPDSDDRK 2NPFFO»f o« R , 

TGT TTT CCT GAT AGC GAT GAT AGA AAG GAA AAC CCC TTT TTC CAA AAG CCT GTT GCT CTG "1 

CCT AAT CTC CCT L L CTA ^ L ^ L L CAC ^ L L TTC ATC L GTT "! S 

tct ^^^m^^InL^^LLiLw^^^:™^ 

TTT TTA GGA GAG GCT GCC ATT GAA AGA TGC TAC TTT GTT CTC ACT TCC CAA GCC AGG GAG 1966 
ACT GAA GAG GCT CAA ATT TTT CAG AGG ATA CTG AAC TTG AAG CCA GCA CAG CTA CTG TTT tile 
TGG GAG AGG GGA GAT GCT GGG GAT AGA AGG AAG AAC ATG GAG GGC CTT CAA CCT GCC CTC 2086 
CAG GAA GTG ATG TTC TCT TCT TGC CTC AGA TGT GTG TCT GTG GAG GAT ATG GCC GCC CTG 
GCC AGG GAG CTG GGG ATT CAG CTA GAT GAA GAC TTT GAA AAC ACT CAG AGA ATT CAA GTT I", 

TCC TCT GGA GAA AAC ATG GCT GGG ACA GCT GAA GCT GAG GCT CAG CAA AGA CAC ACT CAO HI 

~. KSSSK SQALMP I0EPGTQC709 
CTA AAA AGC TCA TCT AAA AGC CAG GCT CTA ATG CCA ATT CAA GAG CCT GGG ACT CAA TGT 2326 

ELS ° NL0NL1 <'GTPVFRPVLE729 
GAG CTC AGC CAG AAT CTT CAG AAT CTC TAT GCT ACC CCA CTA TTC AGG CCT GTT CTA GAG 2386 

NSWLF, PTRIGGNFNHVSLKA749 
AAC TCC TGG CTC TTT CCA ACC AGA ATT GGA GCT AAC TTT AAC CAT CTT TCC TTG AAA GCC 2446 

SW VMGRPF GS EQRPKWFHPL 7S9 
TCC TGG GTT ATG GGC CGC CCC TTT GGG TCA GAG CAG AGG CCT AAG TGG TTC CAT CCT TTG 2506 
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p FQNAGAQGRGKSFGlQSFH789 
CCT TTT CAG AAT GCA GGG GCC CAG GGC CGA GGT AAA AGT TTT GGT ATT CAA TCC TTC CAT 2566 

p Q I FYS GERFMKFSRVARGC 809 
. „ CCC CAG ATA TTT TAT TCA GGT GAA AGA TTC ATG AAA TTT TCC AGA GTT GCT CGG GGA TGT 2626 

HSNGT F G RLP RP I C Q H V Q A C 829 
CAC TCG AAT GGA ACA TTT GGG AGA CTG CCA AGA CCC ATT TGT CAG CAT GTA CAG GCC TGC 2686 

PERp QM .MGTLERSRAVASKl849 
CCT GAG AGA CCA CAA ATG ATG GGA ACT CTT GAA AGG TCT AGG GCA GTA GCC TCC AAG ATA 2746 

GHS YS L D SO.PARAVG K PWP0 869 
GGT CAC TCC TAT TCC CTG GAT TCA CAG CCA GCA AGA GCA GTA GGG AAG CCA TGG CCT CAG 2806 

Q A C T R V T E L T E A T G K L I R T S 869 
CAA GCT TGC ACC AGG GTA ACA GAG TTA ACT GAA GCA ACT GGA AAA CTG ATA AGA ACA TCC 2866 

HI GKP .HPQSFQPAA. ATQKLR909 
CAT ATT GGA AAG CCT CAC CCT CAG TCC TTT CAA CCA GCA GCA GCC ACA CAA AAA CTA AGA 2926 

PA SQOGVQMKTQGGASNPAL 929 
CCT GCT TCT CAG CAA GGA GTC CAG ATG AAG ACA CAA GGT GGG GCT TCA AAT CCA GCT CTC 2986 

Q I GSHPMCKSSQFKSDQSM P949 
CAA ATA GGG TCC CAT CCC ATG TGC AAG AGC TCT CAG TTC AAA TCC GAT CAG TCC AAC CCA 3046 

S TV KHSQPKP .FHSVPSQPKS969 
TCC ACA GTC AAA CAC TCC CAG CCT AAA CCC TTC CAT TCT GTG CCC TCT CAA CCT AAA TCC 3106 

SQTKSCQSQPSQTKPS PCKS 989 
TCT CAG ACA AAA TCC TGT CAG TCC CAG CCC TCC CAA ACT AAA CCT TCT CCA TGC AAA TCT 3166 

T Q P K P S Q PWPPQSKPSQ PRP 1009 
ACT CAG CCT AAG CCA AGC CAG CCC TGG CCT CCC CAG TCT AAG CCT TCT CAG CCC AGA CCC 3226 

p Q p K SSSTNPSQA KAHHSKA 1029 
CCT CAA CCT AAG TCA TCC TCA ACC AAT CCT TCA CAA GCT AAG GCA CAC CAC TCA AAA GCA 3286 

GQKRGGKH* 1038 
GGG CAG AAG AGG GGA GGG AAG CAT TAA 3313 

AGAGCTAACTCCAGAGATCTATAAAGCATATCCTTTACCCAGGCCATTCCTATCATATA 33 92 

GAAAGTAAAAQACTACTCTCATTAGCATGTAAAACAAAGAAAGATATACATGACCGAA 1 1 IblllblllU 3471 

TTTGAGACAQAGTTTCACTCTTGTTGCCCAGGCTGGAGTGCAAT 3 5 SO 

GGCTTAAAGTOATTCTCCTGCCTCAGCCTCTCGAGTAGOT 3629 

TATTTTTAGTAGAGGCAGGGTTTCTCCATGTTGGTCAGGCTGGTCTTGA 3708 

GGCCTCTCAAAGTGTTGGGATTACGTGTGTAAGCCACAGTGCCCAGCCCGAATTGGATATCT 3787 

GTTATATCCCTAACCAAGAAGAAAAATATGAAAATAATTAAGACT 3 866 

GGTATTATTAGATAATGTATAACTTGCACCCAGGGAATGGGGGTCTATGAGACAACCCC 394 5 

AGGGTCTCTAATTG CAAAGTG ACTGTACAAT AGGACGAAAGTTG CCTCTGTGT CTGAGAAAGT ATCTTAGTTGTTGG^ 4024 
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CCTCC^GAGGTATCTTTGTCA^ 4 

AAAT CAACTAGCTTCTGCTAATAGCCCCAATTTGCTTGAATGGGAAAACTCTCT 4 
ATGAATT^CAACCAATAAAA^^ . 4; 
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CONSENSUS *->maederrlLkkiirvrliesLgldvLdelIxivLlekdvlnlkeeEkik 
+++ * r++l+e+L+ d d +Ld L + +e E 

CARD 6 5 STP--SEIIERERKKLLEILQHD-PDSILDTLTSRRLISEEEYErLE 48 

CONSENSUS ragakledDKarelvdslqrrgsqafdafidaledTgqsyLAdvLel<-* 
♦ 1 ♦ r 1++ +q++g- ♦ ++ f + +1++ LA++ +1 

CARD6 49 NVTDLLKK--SRKLLILVQKKGEATCQHFLKCLFS-TFPQLAAICGL 92 
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Claims Nos.: 
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3. Claims Nos.: 
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This International Searching Authority found multiple (groups of) 
inventions in this international application, as follows: 

1. Claims: 1-22 partially 

Isolated nucleic acid molecules as defined by SEQID 1,3 and 
5 he .^rr e ! pon ? in9 iso1atec * Polypeptide sequence as defined 
by SEQID 2; the recombinant expression of the same in host 
cells; an antibody binding to said polypeptide and a method 
for producing said polypeptide, a method for detecting the 
presence of said polypeptide in a sample by contacting the 
sample with the antibody or by hybridization with a probe or 
primer; a. method for identifying a. compound that- binds to 
said polypeptide or modulates, the activity of said 
polypeptide and a method of modulating the activity of said 
polypetide by contacting the polypeptide with said compound. 

2. Claims: 1-22 partially 

Isolated nucleic acid molecules as defined by SEQID 
7,9,25,27,38,40 and the corresponding isolated polypeptide 
sequences as defined by SEQID 8,26,39,41; the recombinant 
expression of the same in host cells; an antibody binding to 
said polypeptide and a method for producing said 
polypeptide, a method for detecting the presence of said 
polypeptide in a sample by contacting the sample with the 
antibody or by hybridization with a probe or primer* a 
method for identifying a compound that binds to said 
polypeptide or modulates the activity of said polypeptide 
and a method of modulating the activity of said polypetide 
by contacting the polypeptide with said compound. 

3. Claims: 1-22 partially 

Isolated nucleic acid molecule as defined by SEQID 42 and 
the corresponding isolated polypeptide sequence as defined 
by SEQID 43; the recombinant expression of the same in host 
cells; an antibody binding to said polypeptide and a method 
for producing said polypeptide, a method for detecting the 
presence of said polypeptide in a sample by contacting the 
sample with the antibody or by hybridization with a probe or 
primer; a method for identifying a compound that binds to 
said polypeptide or modulates the activity of said 
Polypeptide and a method of modulating the activity of said 
polypetide by contacting the polypeptide with said compound. 

4. Claims: 1-22 partially 

Isolated nucleic acid molecules as defined by SEQID 
48,50,60,62 and the corresponding isolated polypeptide 
sequences as- defined by SEQID 49,61 ; the recombinant 
expression of the same in host cells; an antibody binding to 
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said polypeptide and a method for producing said 
polypeptide, a method for detecting the presence of said 
polypeptide in a sample by contacting the sample with the 
antibody or by hybridization with a probe or primer; a 
method for identifying a compound that binds to said 
polypeptide or modulates the activity of said polypeptide 
and a method of modulating the activity of said polypetide 
by contacting the polypeptide with said compound. 



5. Claims: 1-22 partially 

Isolated nucleic acid molecules as defined by SEQID 
54,56,51,53 and the corresponding isolated polypeptide 
sequences as defined by SEQID 55,52; the recombinant 
expression of the same in host cells; an antibody binding to 
said polypeptide and a method for producing said 
polypeptide, a method for detecting the presence of said 
polypeptide in a sample by contacting the sample with the 
antibody or by hybridization with a probe or primer; a 
method for identifying a compound that binds to said 
polypeptide or modulates the activity of said polypeptide 
and a method of modulating the activity of said polypetide 
by contacting the polypeptide with said compound. 
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